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DEVELOPMENTAL TYROSINE KINASES AND THEIR LlGANDS 
FIELD OF THE INVENTION 

The present invention generally relates to protein tyrosine kinase receptors widely 
expressed by early cells of the haematopoietic system, by cells of the neuronal 
system in brain tissue, and in testis, ligands for such receptors and nucleic acid 
molecules encoding such receptors. 

BACKGROUND OF THE INVENTION 

There are several parallels between the development of the haematopoietic and 
neuronal systems. In particular, the presence of regulatory protein molecules 
termed growth factors which recognise and bind to specific cell membrane 
receptors is a common feature of these two systems. It is possible that shared 
families of receptors exist that are expressed in both early haematopoietic and 
neuronal stem cells. In turn, there may be a family of proteins which bind these 
receptors and function as stem cell growth factors. 

The current view of vertebrate haematopoietic ontogeny holds that a succession of 
pluripotential stem cell migrations originate in the yolk sac blood islands, initially 
invade the hepatic rudiment, and then the spleen and bone marrow. From the 
bone marrow, a limited number of multipotential stem cells are laid down during 
embryogenesis that give rise to a much larger population of developmentally 
restricted progenitor cells, and ultimately produce the mature cells of at least eight 
cell lineages. The cells of these lineages are classified as red and white blood 
cells. The white blood cells contain the mature cells of the lymphoid and myeloid 
systems. Lymphoid cells contain T and B lymphocytes and are derived from pre-T 
and pre-B cells, respectively. The myeloid system comprises several cell types 
known as granulocytes, platelets, monocytes, macrophages, and megakaryocytes. 
The granulocytes are further divided into neutrophils, eosinophils, basophils and 
mast cells (see review by Metcalf D. The Molecular Control of Blood Cells . 
Harvard Univ Press, 1988). 
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The haematopoietic system functions by precisely controlling the production of 
cells in the various lineages. Totipotent haematopoietic stem cells have the ability 
to both self-renew and differentiate. Stem cells undergo a series of differentiation 
steps leading to increasingly lineage-restricted progenitor cells. The more mature 
progenitor cells are restricted to production of only one or two lineages. For 
some time the colony-forming unit-spleen (CFU-S) assay served to operationally 
define all stem cells. Recent evidence demonstrates heterogeneity within CFU-S, 
with only a small fraction of CFU-S capable of contributing to long-term 
repopulation following ablation of. the haematopoietic system by irradiation. It is 
recognised that stem and progenitor populations are not discrete, but represent a 
continuum of cells from those of high self-renewal capacity and low probability of 
differentiation to those cells with low self-renewal probability and high 
commitment to differentiation. When long-term haematopoiesis is investigated at 
the clonal level, studies have shown that single stem cell clones are sufficient to 
maintain haematopoiesis over the lifetime of an animal. 

The development of the mammalian embryo is governed by interactions between 
different embryonic cell populations. This process is manifest at the cellular level 
in the precise temporal and spatial control of proliferation, differentiation and 
migration. The coordination of these processes may be achieved in part by the 
action of a family of regulatory molecules termed growth factors. Growth factors 
can evoke diverse responses in different cell types and may interact with one 
another synergistically or antagonistically. Their action is complex and most of 
our current understanding results from in vitro experiments. In most instances, 
haematopoietic growth factor actions defined in vitro have been confirmed in vivo. 
In haematopoiesis, some growth factors are lineage-restricted in their action. 
These include erythropoietin that acts predominantly on red cell development, and 
granulocyte colony-stimulating factor thafs predominant action is on granulocytes. 
At the other end of the spectrum is interleukin-3 which can act on several target 
cells such as granulocyte-macrophage progenitors, eosinophils, megakaryocytes, 
erythroid cells and mast cells. There are no known growth factors that function 
exclusively on haematopoietic stem cells. 
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The ligand for c-kit, termed stem cell factor, kit ligand or mast cell growth factor 
is the product of the Steel (SI) locus in mice. The factor acts either alone or 
synergistically with several known growth factors on primitive stem cells. It is 
believed that this factor is essential for the development of early haematopoietic 
stem cells, and cells of the erythroid and mast cell lineages. 

The stem cell compartment may be viewed as a finely tuned balance between the 
action of inhibitors and the stimulatory role of cytokines. As with other stem cell 
systems, haematopoietic stem cells are distributed in a defined spatial manner 
within adult bones and not in a random, homogeneous mixture of interacting cell 
types. A concept that underlies the regulation of haematopoietic stem cell 
development is that these cells reside within a specialised microenvironment, 
where the regulatory signals act locally. Stromal cells constitute the bone marrow 
microenvironment. 

Embryonic stem cells are permanent cell lines established directly from the inner 
cell mass of the preimplantation mouse embryo. They retain the ability to 
participate in normal embryonic development and, following introduction into the 
blastocyst, generate chimaeric animals that are mosaic in all tissues. Embryonic 
stem cells are increasingly being used as cellular vectors for experimentally 
manipulating the mouse genome. 

Doetschman has demonstrated that embryonic stem cells can generate primitive 
erythroid cells in culture (Doetschman et. al. J. Embrvol. Exp. Morphol. 87, 
27-45, (1985)). This result was achieved by inducing embryonic stem cells to form 
cystic embryoid bodies in the presence of preselected batches of human cord 
serum. 

In addition to haematopoietic cell development, it has been noted that neurons 
also arise in differentiating embryonic stem cells. Haematopoietic differentiation 
in this system occurred infrequently, slowly and was not synchronized. Recently a 
modified system enabling the differentiation of embryonic stem cells in 
methylcellulose into multiple haematopoietic lineages has been described by Wiles 
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and Keller (Development 111 . 259-267, (1991)). Using this approach, 
macrophages, neutrophils, erythroid cells and mast cells develop in a synchronous 
manner with high frequency in the absence of human cord serum. The 
development of haematopoiesis from embryonic stem cells in methylcellulose 
5 cultures parallels the onset of haematopoiesis in the developing mouse embryo. 

An important objective in the field of developmental biology is the identification 
of genes, the products of which mediate regulatory signals required during 
embryogenesis. There is compelling evidence that genes encoding receptor 

10 tyrosine kinases (RTKs) are involved in early development in vertebrates. The 

general family of protein tyrosine kinases can be recognised by the presence of 
several conserved amino acid regions in the catalytic domain. These conserved 
regions have been summarised by Hanks et al (Science 241, 42-52, (1988)) and by 
Wilks et al (Proc. Natl. Acad. Sci. USA 86, 1603-1607, (1989)). The receptor for 

15 macrophage colony-stimulating factor c-frns, which is important in myeloid cell 

differentiation and placental development is an RTK. The mouse developmental 
mutation W has been shown to involve an RTK. The W locus encodes the c-kit 
RTK and affects the proliferative and/or migratory properties of primordial germ 
cells, melanoblasts and haematopoietic stem cells. A recently described RTK 

20 termed flk-2, which is related to c-kit, has been isolated using the polymerase 

chain reaction (PCR) with oligonucleotides to conserved kinase domain motifs. 
Messenger RNA transcripts for flk-2 are expressed in populations enriched for 
stem cells and primitive uncommitted progenitor cells, and are absent in mature 
haematopoietic cells (see Matthews et al. Cell 65, 1143-1152, (1991)). 

25 

Additional receptor tyrosine kinases expressed on pluripotential haematopoietic 
stem cells are needed to facilitate the in vitro growth of stem cells. The nucleic 
acid molecules that encode receptor tyrosine kinases expressed by pluripotential 
stem cells are needed to produce recombinant receptors and ligands. 

30 

In vertebrate development, the cells whose descendants give rise to the nervous 
system are first identified as the neural ectoderm. This forms a tube-like structure 
beneath the surface of the ectoderm. Following closure of the neural tube some 
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precursor cells detach from the apical neural tube and form a transient structure 
called the neural crest. These cells rapidly disperse into the embryo along 
complex migratory pathways. The proliferating neural crest cells also invade 
developing tissues such as the skin, gut, and the adrenal gland to form 
differentiated cell populations within these tissues; eg. melanocytes, enteric 
neurons and adrenal medullary chromaffin cells. 

The diversity of cell types derived from the neural crest poses the problem of how 
uncommitted embryonic cells acquire particular developmental fates. There are 
strong parallels between neural crest cell lineage diversification and the process of 
haematopoiesis. It has been proposed that the earliest neural crest cells should be 
multipotent and maybe capable of self renewal. Secondly, it should be possible to 
identify committed progenitors that proliferate symmetrically and are restricted to 
distinct sublineages and thirdly, there should exist factors which influence the 
proliferation and/or differentiation of specific types of progenitors (see Anderson 
Neuron 3, 1-12, (1989)). 

Soluble proteins variously termed neurotrophic, growth, and neuronal 
differentiation factors have been identified that influence the developmental 
growth, maintenance of function, and plasticity of neuronal populations. These 
factors have been implicated in the proliferation and differentiation of neurons 
during embryonic development and in their growth and survival in the adult 
nervous system. There are a growing number of neurotrophic factors, including 
nerve growth factor, brain-derived neurotrophic factor, neurotrophin-3 and 
neurotrophin-4. These molecules constitute a closely related family sharing at 
least 60% amino acid identity. If the parallel to the haematopoietic system is 
extended, the range and complexity of cells derived from the neural crest implies 
that there will be a large number of protein regulators which control this system. 

Two different types of receptors have been demonstrated for neurotrophins. One 
group of these receptors are transmembrane glycoproteins with tyrosine kinase 
activity encoded by members of the trk protooncogene family. It would therefore 
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be important to isolate additional receptor tyrosine kinases from developing 
systems such as embryonic stem cells which contain neurons. Ligands for such 
receptors are required to act inter alia as neurotrophic factors. Nucleic acid 
molecules encoding the receptors and ligands are needed to produce recombinant 
receptors and ligands. 

It is the object of the present invention to go some way towards fulfilling the 
above objectives or at least to provide the public with a useful choice. 

SUMMARY OF THE INVENTION 

The present invention has a number of aspects. In a first aspect, the invention 
provides a mammalian receptor tyrosine kinase which is a developmental tyrosine 
kinase (Dtk) and which is expressed in multipotential haematopoietic cells, in 
embryonic stem cells, in brain tissue and in testis, but which is not expressed in 
mature lineage-restricted haematopoietic cells. 

In a further aspect, the invention provides an extracellular receptor domain of a 
receptor tyrosine kinase as defined above. In preferred embodiments, this 
extracellular receptor domain can be bound to a support, or can be in a soluble 
form. 

In still a further aspect, the invention provides a nucleic acid molecule encoding a 
receptor tyrosine kinase or extracellular receptor domain as defined above. This 
nucleic acid molecule is preferably DNA. 

In yet a further aspect, the invention provides a vector including a DNA molecule 
as defined above. 

In still a further aspect, the invention provides a method of producing a receptor 
tyrosine kinase comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected with 

a vector as defined above to express the encoded receptor tyrosine 

kinase or extracellular receptor domain; and 
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(b) recovering the expressed receptor tyrosine kinase. 



As yet an additional aspect, the invention provides a ligand that binds to a 
receptor tyrosine kinase as defined above. 

The ligand can take two forms. In one form, the ligand stimulates the 
proliferation, differentiation and/or survival of cells which express a receptor 
tyrosine kinase as defined above (a stimulant ligand). 



10 In the second form, the ligand is antagonistic and at least partially blocks or 

inhibits the function of a receptor tyrosine kinase as defined above through 
binding to said receptor (an antagonistic ligand). 

In another aspect, the invention provides a method of stimulating the 
15 proliferation, differentiation and/or survival of a cell expressing a receptor 

tyrosine kinase as defined above comprising contacting the cell with a stimulant 
ligand as defined above. 

In yet a further aspect, the invention provides a method of inhibiting the function 
20 of a receptor tyrosine kinase as defined above comprising contacting the receptor 

with an antagonistic ligand as defined above. 



In still another aspect, the invention provides a method of treating a disease, 
syndrome or condition caused or mediated by an excess of a ligand as defined 
25 above comprising the step of contacting said excess of said ligand with an effective 

amount of a receptor tyrosine kinase or an extracellular receptor domain as 
defined above. 



In another aspect, the invention provides a method of extracting a ligand from a 
30 medium which may contain said ligand comprising the step of contacting said 

medium with a receptor tyrosine kinase or with an extracellular receptor domain 
as defined above. 
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The invention also provides a method of isolating ligand(s) from a medium which 
may contain said Iigand(s), comprising the steps of: 

(a) contacting said medium with an effective amount of a receptor 
tyrosine kinase or an extracellular domain as defined above; 

(b) detecting which ligand(s) bind; and 

(c) isolating such bound ligand(s). 

While the invention broadly consists in the foregoing, it should be appreciated that 
it also includes the more specific embodiments detailed in the following 
description: 

DESCRIPTION OF THE FIGURES 

Figure 1 shows expression of murine Dtk in embryonic stem (ES) cells and 
embryoid bodies. RNase protection analysis was performed on total RNA (10 fig) 
from ESD3 ES cells growing in Leukaemia Inhibitory Factor (LIF) (day 0), or 
from ES cells maintained in the absence of LIF that were differentiating and 
developing into cystic embryoid bodies (days 2 to 18). As a control tRNA (10 fig) 
was also used. The markers were pBR322 digested with Msp I. The size of the 
free murine Dtk probe was 229 nt. A fully protected fragment representing the 
presence of murine Dtk transcripts was 187 nt in length. The free B-actin 
protected fragment is shown in each lane as an RNA loading control. 

Figure 2 shows expression of murine Dtk in embryonic mouse tissues. RNase 
protection analysis was performed on total RNA (10 fig) isolated from E14.5 
embryonic tissues of the C57BL/6J mouse strain. Details of the markers, probes 
and controls are as described for Figure 1. 

Figures 3 and 4 show expression of murine Dtk in adult mouse tissues. RNase 
protection analysis was performed on total RNA (10 fig) isolated from the various 
tissues of adult C57BL/6J mice. Details of the markers, probes and controls are 
as described for Figure 1. 



WO 94/19463 



-9- 



PCT/NZ94/00009 



Figure 5 shows expression of murine Dtk in murine cell lines. The most abundant 
expression is in the multipotential cell lines FDC-P1 and DA2, and the mast cell 
line P815. The majority of other cell lines art lineage-committed, mature 
haematopoietic cell lines, which have very limited murine Dtk expression. The 
5 NIH 3T3 cell line is derived from embryonic fibroblasts and C2C12 is a myoblast 

cell line. 



Figure 6 shows the cDNA and amino acid sequence of murine Dtk. 

10 Figure 7 shows the cDNA and amino acid sequence of human Dtk. 

DETAILED DESCRIPTION OF THE INVENTION 
A Receptors 

In a first aspect, this invention provides a mammalian receptor protein tyrosine 
15 kinase (PTK). The mammal in which the PTK exists may be any mammal, such 

as a mouse, rat, rabbit or human. 



Members of the PTK family are recognised by the conserved amino acid regions 
in the catalytic domains. Examples of PTK consensus sequences have been 
20 provided by Hanks et al (Science 241 42-52 (1988), especially Figure 1 starting at 

page 46) and by Wilks et al (Proc. Natl. Acad. Sci. USA 86, 1603-1607 (1989), 
especially Figure 2 on page 1605). 

Hanks et al. identify eleven catalytic subdomains containing PTK consensus 
25 residues and sequences. The PTKs of the present invention contain most or all of 

these consensus residues and sequences. 



As indicated above, the PTKs of the invention are receptor PTKs and so are also 
generally referred to as RTKs. Further, as the applicants believe that the RTKs 
30 of the invention are involved in mammalian cell development, they are specifically 

referred to hereinafter as developmental tyrosine kinases (Dtks). 
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The Dtks of the invention are transmembrane receptor tyrosine kinases whose 
extracellular domains contain two immunogiobulin-like motifs followed by two 
fibronectin-type III repeats. RTKs of this structure (Axl(Ufo,Ark)) are already 
known (Janssen et al. Oncogene 6. 2113-2120 (1991); O'Bryan et aL. Mol. Cell. 
5 Biol 11, 5016-5031 (1991); Rescigno et aL Oncogene 6, 1909-1913 (1991); Faust et 

aL Oncogene 7. 1287-1293 (1992)). The Dtks of the invention are however 
distinguished from those RTKs having the equivalently structured extracellular 
domains by their potential function based upon their distribution within the 
mammalian body. 

10 

With regard to this latter feature of the Dtks of the invention, the applicants have 
conducted experiments to determine the range of cells in which the developmental 
tyrosine kinases of the invention are expressed. These experiments were 
specifically performed in relation to murine Dtk but are believed to be illustrative 
15 of the expression of all mammalian Dtks of the invention. 

A.1 Analysis of Murine Dtk expression 

The expression of murine Dtk in a range of embryonic and adult mouse tissues 
was analyzed by ribonuclease protection analysis, using a probe that encompassed 
20 sequences encoding the membrane-proximal portion of the extracellular domain of 

the receptor. 

Materials and Methods 

1. Embryonic stem cell culture 

25 The ESD3 embryonic cell line (Doestschman et al., J. Embrvol. Exp. Morphol. 87 

27-45 (1985)) was maintained on gelatin-coated dishes in Dulbecco's-modified 
Eagle's medium (DMEM) with additives according to established procedures 
(Hogan et al., Cold Spring Harbour Laboratory, 1-332 (1986)), in the presence of 
LIF. Cystic embryoid bodies were established following collagenase treatment of 

W the ES cells and subsequent suspension culture in bacteriological-grade petri 

dishes in DMEM with additives in the absence of LIF (Wiles and Keller, 
Development 111 . 259-265 (1991)). 
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2. Fetal liver haematopoietic stem cell enrichment 

Low density haematopoietic stem cells were isolated from an E14.5 fetal liver cell 
suspension using equilibrium density centrifugation on a discontinuous 
metrizamide gradient according to the method of Visser et al., J. Exp. Med T 59, 
1576-1590 (1984). Following this procedure, low density fetal liver cells (p24 
< 1.078 g/cm 3 ) were incubated for 20 minutes on ice in DMEM medium with 5 
/*g/10 6 cells of AA4 monoclonal antibody (rat IgG 2b ; McKearn et al., Proc. Natl. 
Acad. Sci. USA . 82. 7414-7418 (1985)) and washed twice. This antibody has been 
shown to recognise the most primitive haematopoietic stem cell in fetal liver 
(Jordan et al., CeU, 61, 953-963 (1990)). The AA4 labelled cells were then 
incubated on ice for 20 minutes with magnetic beads conjugated with anti-rat IgG 
antibody as outlined in the manufacturer's protocol (Advanced Magnetics Corp., 
Cambridge, MA). Following incubation, AA4 + cells were positively-selected on a 
magnet. Stem cell enrichment was assessed by re-labelling the cells with the AA4 
antibody, followed by a second layer antibody staining with goat anti-rat 
fluorescein isothiocyanate and flow cytometric analysis on a FACS 440 (Becton 
Dickinson, San Jose, CA). 

3. RNA analysis 

RNase protection analysis was performed by hybridization of 10 fig of total RNA 
to RNA probes that encoded sequences of murine Dtk and B-actin, overnight at 
52°C. RNase digestion was performed with RNase Tl (1.75 ^g/ml) and RNase A 
(35 ^g/ml) at 37°C for one hour. The reaction was stopped with proteinase K 
(333 //g/ml) and SDS (0.3%). The products were run on a 6% urea/acrylamide 
gel and the autoradiograph exposed at -70°C. The probe for analysis of Dtk 
expression was derived from nucleotides 1158 to 1334 of the Dtk sequence, a 
segment which encodes the membrane-proximal portion of the extracellular 
domain, and which had been subcloned into pGEM-4Z. In an RNase protection 
assay, the free probe yielded a 229 nucleotide (nt) band, and Dtk transcripts 
protected a fragment of 187 nt. A riboprobe was also constructed from a Sal 
I-Sma I fragment of human B-actin. The length of the free B-actin probe was 132 
nt and B-actin transcripts protected a 54 nt fragment. 
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Results 

1. Embryonic stem cells 

Figure 1 demonstrates the expression of Dtk transcripts in both totipotent ES cells 
growing in LIF (termed day 0), and in differentiating cystic embryoid bodies 
growing in the absence of LIF for up to 18 days. In this developmental system 
Dtk is expressed almost uniformly from days 0 to 18, indicated by the presence of 
a protected 187 nt band for each RNA analyzed. The two bands of approximately 
220 nt and 210 nt present in lanes for each RNA sample analyzed are also present 
in the tRNA lane and are regarded as nonspecific. Of considerable interest with 
regard to the importance of this receptor in mouse development is the 
demonstration of Dtk expression in totipotent ES cells. The ES cells from which 
RNA was extracted for day 0 analysis were selected from cultures, following 
morphological assessment by phase-contrast microscopy to confirm that they were 
undifferentiated. 

2. Embryonic tissues 

Expression of Dtk was detected in total RNA isolated from a wide range of mid- 
gestational E14.5 embryonic mouse tissues including the brain, eye, thymus, lung, 
intestine, forelimb, hindlimb and testis (Figure 2). There was limited expression 
in heart and unfractionated liver. 

Figure 2 shows enrichment of Dtk transcripts in E14.5 fetal liver low density 
AA4 + haematopoietic stem cells. Following density-gradient centrifugation and 
positive selection, the cells used for RNA analysis were greater than 95% AA4 + , 
as assessed by flow cytometry (data not shown). Dtk expression was also detected 
in day 14.5 placenta. 

3. Adult tissues 

In contrast to the widespread expression of Dtk in embryonic tissues, the pattern 
of expression in adult tissues becomes restricted (Figures 3 and 4). Dtk transcripts 
were most abundant in brain, esophagus, bladder, testis, and ovary. In brain, 
expression of Dtk (relative to 6-actin) was more abundant in adult than in 
embryonic tissue. Adult tissues which contained less abundant, but detectable 
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transcripts were lung, and regions of the gastrointestinal tract including the 
stomach and both the small and large intestine. Tissues in which Dtk transcripts 
were undetectable 01 expressed at extremely low levels included the salivary gland, 
thymus, heart, liver, skeletal muscle, kidney, spleen, bone marrow, adrenal gland 
5 and uterus. 

4. Murine cell lines 

The pattern of expression of Dtk in murine cell lines was analyzed in relation to 
the following: WEHI-3B, 416B, EL4, S03, SP2/0, P388D,, P815, FDC-P1, DA2, 
10 FDC-P1/IL-2 ras, NIH3T3 and C2C12. The results are shown in Figure 5. 

As can be seen from Figure 5, the results are consistent with those above, with the 
most abundant expression being in the multipotential cell lines FDC-P1 and DA2, 
and in mast cell line P815. Significant expression is also observed in myoblast cell 
15 line C2C12. 

In contrast, the remaining cell lines (lineage-restricted mature haematopoietic cell 
lines) show very limited murine Dtk expression. 

20 From this analysis, the applicants have derived the condition defining the Dtks of 

the invention - they are expressed in multipotential haematopoietic cells, in 
totipotent embryonic stem cells, in brain tissue and in testis, but not in mature 
lineage-restricted haematopoietic cells. 

25 For the purpose of this specification, a multipotential haematopoietic cell is an 

early haematopoietic cell. Examples of multipotential haematopoietic cells 
include multipotential factor-dependent cells that have the capacity to proliferate 
and differentiate into mature haematopoietic cells. In contrast, a mature 
haematopoietic cell is non self-renewing and has limited ability to give rise to 

30 multiple cell lineages. Mature lineage-restricted haematopoietic cells, for the 

purposes of this specification, are therefore represented by haematopoietic cell 
lines of the T or B lymphoid lineage or mature myeloid lineages. 
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The Dtks of the present invention may or may not be expressed in intermediate 
cells poised between the state of being multipotential and mature. 

In terms of brain tissue, the Dtks of the invention are primarily expressed in 
neuronal cells. 

In terms of testis, the Dtks are primarily expressed in the Sertoli cells. 

It will of course be appreciated by those persons skilled in this art that the 
reference to the Dtks of the invention not being expressed in mature lineage- 
restricted haematopoietic cells is in a biological context and does not mean that 
there is absolutely no expression of the Dtk in these cells. As is apparent from 
Figures 1 to 5, what is meant by the phrase "not expressed in mature-lineage 
restricted haematopoietic cells" is that there is no significant expression of the Dtk 
in the cell, i.e. that expression is either undetectable or at an extremely low level. 

The restricted expression of the Dtks of the invention to cells representative of 
early multipotential cells, with substantial absence of expression in lineage- 
restricted cells such as T or B lymphocytes, is consistent with this receptor 
functioning and transducing signals from the microenvironment to the 
haematopoietic stem cell compartment. The expression of the Dtk in embryonic 
stem cells and in some fetal tissues such as brain is also consistent with this 
receptor and its ligand having a functional role in the specification of cell lineages 
during embryonic development, including neuronal development. Furthermore, 
the receptor and its ligand is likely to have a role in the maintenance of function 
and plasticity in neuronal populations or their derivatives. Finally, the expression 
of the receptor in adult brain is consistent with the receptor and its ligand having 
a role in the growth and survival of neurons in the adult nervous system. 

The embryonic stem cell and haematopoietic multipotential cell line mRNA for 
Dtk migrates relative to 28S and 18S ribosomal bands on formaldehyde agarose 
gels at approximately 4.2 Kb. In adult brain tissues, Dtk mRNA migrates at 
approximately 4.2 Kb. 
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The Dtks of the invention can usefully be provided in a number of different forms. 
These include the Dtk itself, the "mature" form of the Dtk, and the extracellular 
receptor domain of the Dtk. 

The "mature" form of the Dtk of the invention is the Dtk less its native amino- 
terminus leader or signal sequence, whereas the extracellular receptor domain is 
the Dtk lacking the transmembrane region and catalytic domain. 

The extracellular domain may be identified through commonly recognised criteria 
of extracellular amino acid sequences. The determination of appropriate criteria 
is known to those skilled in the art, and has been described, for example by Hopp 
et al.. Proc. Natl. Acad. Sci. USA 78. 3824-3828 (1991); Kyte et al.. J. Mol. Biol. 
157, 105-132 (1982); Emini. J. Virol 55. 836-839 (1985); Jameson et al. CA BIOS 
4, 181-186 (1988); and Karplus et al. Naturwissenschaften 72. 212-213 (1985). 
Amino acid domains predicted by these criteria to be surface exposed are 
characteristic of extracellular domains. 

The Dtks of the invention or their extracellular receptor domains may be prepared 
by methods known in the art. Such methods include protein synthesis from 
individual amino acids as described by Stuart and Young in "Solid Phase Peptide 
Synthesis", Second Edition, Pierce Chemical Company (1984). It is however 
preferred that the Dtks and/or their extracellular receptor domains be prepared 
by recombinant methods as will be detailed hereinafter. 

A.2 Specific Dtks of the Invention 
A.2.1 Murine Dtk 

As is indicated above, a first Dtk of the invention, murine Dtk, has been identified 
in certain tissues of the mouse. Murine Dtk generally has the nucleic acid and 
deduced amino acid sequence shown in Figure 6. Figure 6 represents individual 
amino acid residues as single letters as follows: 
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Three-letter One-letter 
Amino Acid abbreviation symbol 



c 

J 


Alanine 


Ala 


A 




Arginine 


A 

Arg 


R 




Asparagine 


A 

Asn 


N 




/\spartic aciu 


Asp 


D 




Asparagine or aspartic acid 


A — 

Asx 


B 


1 n 


Cysteine 


Cys 


C 




Glutamine 


Gin 


Q 




Glutamic Acid 


Glu 


E 




Glutamine or glutamic acid 


Glx 


Z 




Glycine 


Gly 


G 


1 c 
1 J 


rlistidine 


T T* 

His 


H 




lsoleucme 


He 


I 




Leucine 


Leu 


L 




Lysine 


Lys 


K 




Methionine 


Met 


M 


20 


Phenylalanine 


Phe 


F 




Proline 


Pro 


P 




Serine 


Ser 


S 




Threonine 


Thr 


T 




Tryptophan 


Trp 


W 


25 


Tyrosine 


Tyr 


Y 




Valine 


Val 


V 



Details of the sequence of murine Dtk are as follows. 



30 Sequence Analysis of the Murine Dtk 

Figure 6 shows the 3.919 Kb nucleotide and deduced amino acid sequence for 
murine Dtk from murine neonatal brain. Within the 5' region, a potential site for 
translation initiation (-GGAGCATGGGG-) is found within a good Kozak 
consensus sequence. The first methionine initiates an open reading frame of 874 

35 amino acids. Using the method of von Heijne Sequence Analysis in Molecular 

Biology 113-117, San Diego, Academic Press (1987), the signal cleavage site is 
predicted to be between alanine 24 and alanine 25, which specifies a 24 amino 
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acid hydrophobic leader sequence and a mature receptor tyrosine kinase protein 
of 850 amino acids. Amino acids AGLK to PHSR form a 386 amino acid 
extracellular domain. A 25 amino acid hydrophobic region from TSWV to LILL 
is consistent with that of a transmembrane domain (Fasman and Gilbert, Trends 
Biochem .15, 89-92 (1990)), while the remaining amino acids ending HSSC 
comprise the cytoplasmic domain. 

The extracellular domain of murine Dtk contains eight consensus sites (NxT or S) 
for AMinked glycosylation, predicting that the mature Dtk protein is glycosylated. 
Within the extracellular domain, two repeating protein motifs are identifiable. 
Using the predictive methods of Williams and Barclay, Ann. Rev. Immunol 6, 
381-405 (1988), two C-type immunoglobulin (Ig)-like domains are present from 
amino acids KLMG to GEET (Ig-like domain I) and FFTV to NIKG (Ig-like 
domain II). The first Ig domain has a structure similar to a CI domain, while the 
second Ig domain is more C2-like. Based on the analysis of Petersen et aL. Proc. 
Natl. Acad. Sci. USA 80, 137-141 (1983), there are two fibronectin type III 
modules present from amino acids PPAA to PYGD (domain I) and from amino 
acids PFQT to SHDH (domain II). 

Analysis of the 439 amino acid cytoplasmic domain sequence of murine Dtk shows 
many of the motifs which are highly conserved within the catalytic kinase domain 
of protein tyrosine kinases (Hanks et aL, Science 241 . 42-52 (1988)). The motifs 
GKGEFG and VAVK, which function as the Mg 2 * -ATP binding site (Ullrich and 
Schlessinger, CeJJ 61, 203-212 (1990); Cantley et al., CeH 64, 281-302 (1991)), are 
observed at the start of the kinase domain. Further towards the carboxy-terminus 



WO 94/19463 PCT/NZ94/00009 

-18- 

of Dtk other conserved kinase motifs are identifiable, including the motif 
IHRDLAARN, the DFG triplet motif and the motifs KWLALES and DVWAFG. 
Alignment of the kinase domain of Dtk with other protein tyrosine kinase domains 
including that of Ufo, suggests there is a kinase insert region specified by the 
amino acids RIGENPFN. There are 12 tyrosine residues within the cytoplasmic 
domain of Dtk, including two residues located near the C-terminus that are nested 
within sequences that exhibit strong homology to Src homology 2 (SH2) domain 
binding sites (Songyang et al., CeU 72, 767-778 (1993)). One of these sequences, 
EEVYDLM, is a putative binding site for phosphatidylinositol 3-kinase, but lies 
within the catalytic domain proper and is unlikely to be autophosphorylated. The 
sequence DPLYINI fulfills criteria for either a Sem5/Grb2 binding site or a 
phospholipase C-y binding site (Songyang et aL, (1993)) supra, and its position in 
the C-terminal tail makes it a good candidate for phosphorylation. 

15 In specific aspects, the invention provides murine Dtk, mature murine Dtk and the 

extracellular receptor domain of murine Dtk. 

Murine Dtk has the amino acid sequence given as SEQ ID NO 1. 

20 Mature murine Dtk has the amino acid sequence given as SEQ ID NO 2. 

The extracellular receptor domain of murine Dtk has the amino acid sequence 
given as SEQ ID NO 5. 
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The invention also includes functional equivalents of murine Dtk, mature murine 
Dtk and the extracellular receptor domain of murine Dtk as is described 
hereinafter. 

A.2.2 Human Dtk 

A second Dtk of the invention has been identified from human tissue. This 
second receptor is the human homologue of murine Dtk having all of the 
structural features of murine Dtk. 

The nucleic acid and deduced amino acid sequence for this receptor tyrosine 
kinase, hereinafter called "human Dtk", is shown in Figure 7. Sequence details are 
as follows. 

Sequence Analysis of the Human Dtk 

Figure 7 shows the 4.364 Kb nucleotide and deduced amino acid sequence for the 
human Dtk from human fetal brain. The structural features of human Dtk closely 
parallel those described for murine Dtk. The signal peptide encompasses amino 
acids MGRP to ESAA The mature protein extends from residues AGLK to 
HSSC. Within the mature protein the extracellular domain is defined by residues 
AGLK to PHSR, the transmembrane domain by residues TSWV to LILL, and the 
cytoplasmic domain from residues RKRR to HSSC. 

The extracellular domain contains two repeating protein motifs made up of two 
immunoglobulin domains (KLMG to GGET and FFTV to NLKG), followed by 
two fibronectin type III modules (LPAA to PYAD and PFQT to SHDR). The 
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protein tyrosine kinase domain is encompassed by the amino acids LGKG to 
RMEL within the cytoplasmic domain. The motifs defined within the murine 
protein tyrosine kinase domain are also identifiable within the human protein 
tyrosine kinase domain. 

Once again, in its specific aspects the invention provides different forms of the 
Dtk (human Dtk, "mature" human Dtk and the extracellular receptor domain of 
human Dtk). 

Human Dtk has the amino acid sequence given as SEQ ID NO 3. 

Mature human Dtk has the amino acid sequence given as SEQ ID NO 4. 

The extracellular receptor domain of human Dtk has the amino acid sequence 
given as SEQ ID NO 6. 

Once again, the invention further includes functional equivalents of human Dtk, 
mature human Dtk and of the extracellular receptor domain of human DTK. 

A.2.3 Other Mammalian Dtks 

In addition to the murine and human Dtks described above, the invention includes 
within its scope Dtks of other mammals. Such Dtks are the homologues of both 
murine and human Dtk and can be readily identified by those persons skilled in 
the art with reference to the characterising data given above for murine Dtk and 
human Dtk. 
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By way of example, one method for identifying other Dtks of the invention 
involves the formation of a DNA library from a suitable tissue source (such as 
brain) obtained from the mammal. This library can then be screened to identify 
DNA coding for homologues to murine Dtk and human Dtk as will be described 
in more detail below. 

B. Nucleic Acid Molecules Encoding the Dtks of the Invention 
In another aspect of this invention, the applicants provide nucleic acid molecules 
encoding the Dtks. These nucleic acid molecules may be DNA (isolated from 
nature, synthesised or cDNA) or RNA Most often, the nucleic acid molecules 
will be DNA. 

B.l Nucleic Acid Molecules Encoding Murine Dtk and Human Dtk 
As indicated above, the nucleic acid sequences for murine Dtk and human Dtk 
have been determined. In specific aspects, the invention therefore provides 
nucleic acid molecules (in the form of DNA) as follows: 

1. A DNA molecule encoding murine Dtk having the nucleotide sequence 
given as SEQ ID NO 7. 

2. A DNA molecule encoding mature murine Dtk having the nucleotide 
sequence given as SEQ ID NO 8. 

3. A DNA molecule encoding the extracellular receptor domain of murine 
Dtk having the nucleotide sequence given as SEQ ID NO 11. 
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4. A DNA molecule encoding human Dtk having the nucleotide sequence 
given as SEQ ID NO 9. 



5. A DNA molecule encoding mature human Dtk having the nucleotide 
sequence given as SEQ ID NO 10. 

6. A DNA molecule encoding the extracellular receptor domain of human 
Dtk having the nucleotide sequence given as SEQ ID NO 12. 

The invention also includes within its scope functional equivalents of these DNA 
molecules. 

B.2 Nucleic Acid Molecules Encoding Dtks of other Mammals 
It will be appreciated that DNA molecules encoding the functional equivalent 
homologues of murine Dtk and human Dtk from other mammals are also within 
the scope of the invention. Such DNA molecules can be readily identified using 
conventional techniques and with reference to the information contained herein 
characterising murine Dtk and human Dtk. 



By way of generic illustration, DNA molecules encoding homologues of murine 
Dtk and human Dtk in other mammals can be identified by employing the 
following general steps: 

(a) Formation of a cDNA library: 

Total mRNA from a suitable tissue source (such as brain) of the 
mammal is prepared by standard procedures (Ausubel et al, (Eds), 
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"Current Protocols in Molecular Biology" Greene Associates/ Wiley 
Interscience, New York (1990)), and cDNA synthesised. A cDNA 
library is formed (for example in k ZAP II). 

Library Screening: 

The cDNA library formed as above is screened for the presence of 
cDNA encoding homo|ogues to murine Dtk and human Dtk. 
Screening will generally employ a DNA hybridisation or 
amplification step with the probes or primers being selected based 
upon the already determined sequences of murine and human Dtk. 

Most conveniently, the screening procedure will involve DNA 
amplification using the polymerase chain reaction (PCR) (Saiki et al 
Science 239 . 487 (1988)) with the PCR primers being selected such 
that highly conserved regions from within the DNA sequence of 
murine and human Dtk will be within the amplified PCR product. 

DNA Isolation and Sequencing: 

Clones from the cDNA library which are identified by screening step 
(b) as containing cDNA encoding homologues to murine and human 
Dtk are selected, and the size of the cDNA insert sourced from the 
brain determined. Such clone(s) including a cDNA insert of the 
appropriate size to code for the full-length Dtk are selected and the 
cDNA insert isolated. Each isolated cDNA insert is then sequenced 
using known procedures (for example, using the standard 
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dideoxy chain-termination method of Sanger et al., Proc. Natl. Acad. 
Sci. USA 74. 5463-5467 (1977)). 

B.3 Genetic Mapping of Murine Dtk and Human Dtk 

By way of further characterisation of both murine Dtk and human Dtk, the 

applicants have performed experiments to establish the chromosomes on which the 

genes coding for these Dtks are located. Details of these experiments are given 

below. 

Materials and Methods 

B.3.1 Fluorescent In Situ Hybridization fFISHI 

A partial Sau3A genomic DNA library in X. 2001, prepared from mouse ES cells 
(Boehm et al., Proc. Natl. Acad. Sci. USA 88, 3927-3931 (1991)), was screened 
with the 3.525 kb cDNA insert purified from pMo23A using methods previously 
described (Morris, et al.. Blood 76. 1812-1818 (1991)). Of 34 positive clones, two 
of the most intensely hybridizing, AMo23A-7.1 and AMo23A-8.1, were selected for 
FISH studies. The pMo23A plasmid, and DNA isolated from bacteriophage 
clones A Mo23A-7.1 and A.Mo23A-8.1, were biotinylated by nick-translation using 
biotin-14-dATP (Bethesda Research Laboratories, Gaithersberg, MD). 
Karyotypically normal, 40,XY, mouse metaphase cells were prepared from ES 
cells in culture using standard procedures. Fluorescent in situ hybridization and 
detection procedures were essentially as described (Morris et al., Human Genetics 
91, 31-36 (1993)), except that mouse Cot 1 DNA (BRL, final concentration 
250ng/^I) was used to suppress repetitive sequences in the two phage DNA 
probes. Chromosomes were G-banded using DAPI (4\6-diamidino-2- 
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phenylindoledihydrochoride, Sigma, St Louis, MO) as a counterstain for 
fluorescence analysis. 

B.3.2 Single-strand conformation polymorphism (SSCP) 

Primer sequences from the 3' untranslated region of the Dtk cDNA used for 

genetic mapping were as follows: 

DtkMapl 5' TGGATGGCAGTAAGGGAGG 3' 

5' CTTAAGAGGGGCAAACCTGG 3' 
DtkMap2 5' GCTTAGAGGAGGTGAGCCAGA 3' 

5' TG G G C AGTG CTG AGTTCC 3' 

PCR was performed using standard conditions with the addition of 32 P-labelled 
dCTP. Specifically, 25 //l reactions were performed in 10 mM-Tris-HCl, 50 raM 
KC1 using 250 ng of genomic DNA, 1 ulA of each primer, and 1.4 mM MgCl 2 . 
This was overlaid with oil, denatured at 94°C for 5 minutes, and transferred to an 
80°C heating block. dNTPs were added to a final concentration of 0.2 mM, 
including 1.25 ^Ci of [c- 32 P]dCTP (1//1 of a 3000 Ci/mmole stock to 8 reactions). 
1.25 units of AmpliTaq DNA polymerase (Perkin-Elmer Cetus) was added and 
cycling conditions were as follows: 58°C annealing reaction for 1 minute, 72°C 
extension reaction for 2 minutes, and 91°C denaturation for 1 minute. The cycle 
was repeated 30 times with a final 72°C extension reaction for 5 minutes. SSCP 
analysis was performed by electrophoresing the single-stranded PCR products on a 
non-denaturing gel as follows: 2 fi\ of the PCR reaction was added to 8 fd of USB 
stop solution (100% formamide containing xylene cyanol and bromophenol blue). 
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This was denatured for 5 minutes at 94°C and transferred to an ice bucket. 3/d 
was loaded on a 5% non-denaturing acrylamide gel containing 0.5X TBE and no 
glycerol. This was run in a 4°C cold room in 0.5X TBE at 40 watts constant 
power for 2-3 hours. The gel was transferred to filter paper, dried, and 
autoradiographed overnight with an intensifying screen. 

Results 

The chromosomal localisation of the gene encoding murine Dtk has been 
established on chromosome 2 band F using fluorescent in situ hybridisation. This 
result has been confirmed using single strand conformation polymorphism analysis 
in the BXD recombinant inbred series. 



The gene encoding human Dtk has been mapped using fluorescent in situ 
hybridisation to chromosome 15ql5. 

C Recombinant Expression of Dtks of the Invention 

In yet another aspect, the present invention relates to the recombinant expression 
of the Dtks or of their extracellular receptor domains. 

As will be exemplified below, the nucleic acid molecules that encode the receptors 
or the extracellular receptor domains of the invention may be inserted into known 
vectors for use in standard recombinant DNA techniques. Standard recombinant 
DNA techniques are those such as are described in Sambrook et al.; "Molecular 
Cloning" 2nd Edition Cold Spring Harbour Laboratory Press (1987) and by 
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Ausubel et al., Eds, "Current Protocols in Molecular Biology" Greene Publishing 
Associates and Wiley-Interscience, New York (1987). 

Vectors for expressing proteins in bacteria, especially E. coli . are known. Such 
vectors include the PATH vectors described by Dieckmann and Tzagoloff in J. 
Biol. Chem . 260 . 1513-1520 (1985). These vectors contain DNA sequences that 
encode anthranilate synthetase (TrpE) followed by a polylinker at the carboxy 
terminus. Other expression vector systems are based on beta-galactosidase 
(pGEX); lambda P maltose binding protein (pMAL); and gluthathione S- 
transferase (pGST) - see Gene 67. 31 (1988) and Peptide Research 3. 167 (1990). 

Vectors useful in yeast are available and well known. A suitable example is the 
2fi plasmid. 

Suitable vectors for use in mammalian cells are also known. Such vectors include 
well-known derivatives of SV-40, adenovirus, retrovirus-derived DNA sequences 
and vectors derived from combination of plasmids and phage DNA. 

Further eucaryotic expression vectors are known in the art (e.g. PJ. Southern and 
P. Berg. J. Mol. Appl. Genet . 1. 327-341 (1982); S. Subramani et al. Mol Cell. 
Biol . J, 854-864 (1981); R.J. Kaufmann and P.A Sharp, "Amplification And 
Expression of Sequences Cotransfected with a Modular Dihydrofolate Reductase 
Complementary DNA Gene." J. Mol. Biol . 159 . 601-621 (1982); R.J. Kaufmann 
and P.A. Sharp. Mol. Cell. Biol . 159 . 601-664 (1982); S.I. Scahill et al, "Expression 
And Characterization Of The Product Of A Human Immune Interferon DNA 
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Gene In Chinese Hamster Ovary Cells," Proc. Natl. Acad. Sci. USA 80, 4654-4659 
(1983); G. Urlaub and L.A. Chasin. Proc. Natl. Acad. Sci. USA 11. 4216-4220, 
(1980). 

5 The expression vectors useful in the present invention contain at least one 

expression control sequence that is operatively linked to the DNA sequence or 
fragment to be expressed. The control sequence is inserted in the vector in order 
to control and to regulate the expression of the cloned DNA sequence. Examples 
of useful expression control sequences are the ]ac system, the trp. system, the tac 

10 system, the trc system, major operator and promoter regions of phage lambda, the 

control region of fd coat protein, the glycolytic promoters of yeast, e.g. the 
promoter for 3-phosphoglycerate kinase, the promoters of yeast acid phosphatase, 
e.g. Pho5, the promoters of the yeast alpha-mating factors, and promoters derived 
from polyoma, adenovirus, retrovirus, and simian virus, e.g. the early and late 

15 promoters or SV40, and other sequences known to control the expression of genes 

of prokaryotic and eucaryotic cells and their viruses or combinations thereof. 



Vectors containing the receptor-encoding DNA and control signals are inserted 
into a host cell for expression of the receptor. Some useful expression host cells 

20 include well-known prokaryotic and eucaryotic cells. Some suitable prokaryotic 

hosts include, for example. E. coli , such as E. coli SG-936. E. coli HB 101. E. coli 
W3110. E. coli X1776. E. coli X2282. E. coli DHT, and E. coli MR01, 
Pseudomonas . Bacillus , such as Bacillus subtilis . and Streptomyces . Suitable 
eucaryotic cells include yeast and other fungi, insect, animal cells, such as COS 

25 cells and CHO cells, human cells and plant cells in tissue culture. 
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A specific although non-limiting example of this aspect of the invention is set out 
below. It will be appreciated that while the expression of murine Dtk is 
exemplified, the procedures disclosed are equally applicable to the expression of 
other Dtks, or to the expression of extracellular receptor domains of such Dtks. 

C. l Expression of cloned murine Dtk in heterologous cell lines 

The coding region of murine Dtk was ligated in-frame into the commercially 
available expression vector pcDNA3 (InVitrogen) using standard molecular 
biology techniques. The pcDNA3-Dtk construct was electroporated into several 
heterologous cell lines to demonstrate expression of Dtk. Electroporation, drug 
selection and isolation of Dtk-expressing clones for each cell line followed 
standard techniques (in M. Kriegler, "Gene Transfer and Expression - A 
Laboratory Manual", Stockton Press, New York 1990). 

The Dtk construct was expressed in the factor-dependent cell lines FDC-P1, 
BAF/3 and 32D, and in the NIH 3T3 cell line (all commercially available). The 
expression of Dtk in these cell lines has been ascertained at the level of RNA 
using standard techniques for the isolation of RNA and its detection using 
radiolabeled Dtk probes which are familiar to those experienced in the field (see 
Sambrook et al., "Molecular Cloning," Second Edition, supra vide). 

D. Ligands 

The invention also includes ligands that bind to the Dtks of the invention. 



94/19463 PCT/NZ94/00009 

-30- 

The ligand may be a protein such as a growth factor that occurs naturally in a 
mammal, preferably the same mammal that produces the corresponding Dtk. The 
growth factor may be isolated and purified, or be present on the surface of an 
isolated population of cells, such as stromal cells. 

The ligand may also be a molecule that does not occur naturally in a mammal. 
For example, antibodies raised against the receptors of the invention or against 
anti-ligand antibodies mimic the shape of, and act as, ligands if they constitute the 
negative image of the receptor or anti-ligand antibody binding site. 

Such antibodies may be polyclonal but are preferably monoclonal. Monoclonal 
antibodies may be produced by methods known in the art. These methods include 
the immunological method described by Kohler and Milstein in Nature 256 . 495- 
497 (1975) and Campbell in "Monoclonal Antibody Technology, the Production 
and Characterization of Rodent and Human Hybridomas" in Burdon et al. Eds, 
Laboratory Techniques in Biochemistry and Molecular Biology, Volume 13, 
Elsevier Science Publishers, Amsterdam (1985); as well as by the recombinant 
DNA method described by Huse et al. in Science 246 . 1275-1281 (1989). 

In yet another form, the ligand may also be a non-protein molecule that acts as a 
ligand when it binds to, or otherwise comes into contact with, the receptor. 

In addition, ligands may be of two functional types. The first functional type of 
ligand is a molecule which binds to the receptor and stimulates it in performing its 
normal function (a "stimulant ligand"). The second functional type of ligand is a 
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molecule which binds to the receptor and inhibits or prevents it performing its 
normal function (an "antagonistic ligand"). 

Both types of ligand will find application in either therapeutic or prophylactic 
treatments as described below. 

D.l Sources of Ligands 

The strategy for isolating a ligand for the Dtks of the invention is based on the 
assumption that the ligand will either be a soluble, secreted protein or 
alternatively it will be membrane-bound or associated. 

To screen for soluble ligands, conditioned media from a range of tumor cell lines 
and tissues can be used. Such cell lines are readily available from the American 
Type Culture Collection (ATCC) Rockville, Maryland, USA. Conditioned media 
is generated from these cell lines using a variety of culture and induction 
protocols. The cell lines are grown using standard tissue culture techniques which 
are detailed by ATCC for each cell line. Conditioned medium from tissues is 
generated by growing minced tissue fragments in culture medium for a defined 
time period. 

To screen for membrane-associated ligands a different approach is taken. Cell 
lines in which from tissues which are in close proximity to those cells or tissues 
which have been shown to express the Dtk receptor are used. This approach is 
based on the likelihood of close cell-to-cell contact between receptor-expressing 
cells and ligand-expressing cells. An example of this is in the testis where Sertoli 
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cells express the receptor, while germ cells are considered a likely source of 
membrane-bound ligand. A further example in the brain would be where one 
type of neuron expresses the receptor, while microglial cells or another non- 
neuronal brain cell are considered likely to express the ligand. 

D.2 Ligand Screening Procedures 

In illustrating the screening procedures, reference will be made to murine Dtk as 
representative of the Dtks of the invention. Equivalent procedures can of course 
be employed in screening protocols using other mammalian Dtks such as human 
Dtk or the extracellular receptor domains of such Dtks. 

Two approaches are followed to screen for the ligand for murine Dtk. If the 
ligand is soluble, assays which utilise either growth responses or changes in 
tyrosine phosphorylation will be used. Alternatively, if the ligand is membrane- 
bound, ligand-expressing cells will be detected using a Dtk-tag protein system 
whereby the extracellular domain of Dtk is fused with sequence encoding part of 
the human immunoglobulin molecule, such as the Fc region or the fi chain. The 
tag can then be detected using reagents which bind to the tag, such as Protein A- 
alkaline phosphatase or Protein A-radioiodine 125 . 

D.3 Soluble ligand 

To detect soluble ligand in the media conditioned by tumor cell lines or tissues, a 
range of concentrations of this media are added to one of the factor-dependent 
cell lines described above, that have been transfected with, and express the Dtk 
receptor. These cell lines are routinely maintained in interleukin-3 containing 
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tissue culture medium. By withdrawing this medium and adding sources of 
potential ligand for Dtk, a growth response will be sought that is mediated via the 
introduced Dtk receptor. This response can be detected using the uptake of 
radiolabeled thymidine and counting this uptake by liquid scintillation 
spectroscopy. These techniques are standard for those familiar in the art (see 
Kriegler (supra); and Crosier et aL Proc. Natl. Acad. Sci. USA 88 : 7744-8 (1991)). 

An alternative detection system for ligands contained in tumor cell line 
conditioned medium uses the Dtk-expressing NIH 3T3 cell line as an indicator 
system, in conjunction with monitoring alterations in tyrosine phosphorylation of 
the Dtk receptor. Conditioned medium that contains the ligand for Dtk will 
trigger activation of the receptor which in turn is reflected in the phosphorylation 
status of the receptor. The system uses standard techniques whereby the NIH 3T3 
cells are incubated with conditioned medium, cell lysates produced which in turn 
are immunoprecipitated with an anti-murine Dtk polyclonal antibody, proteins are 
resolved on SDS-PAGE gels, followed by transfer to nitrocellulose filters and 
subsequent Western blotting with an anti-phosphotyrosine antibody and detection 
using enhanced chemiluminescence techniques. These techniques are standard 
protein biochemistry methods (see B. Sefton and T. Hunter (eds), "Methods in 
Enzymology," vol 200 and 201, 1990; and Amersham, Manufacturer's protocols for 
ECL techniques). The expected result with this technique would be that potential 
ligand-containing media would stimulate increased tyrosine phosphorylation, 
compared with background levels detected in these cells. 
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D.4 Membrane-bound ligand 

Screening for membrane-bound or associated ligands for the Dtk receptor relies 
on the use of a Dtk-tag fusion protein detection system. The extracellular domain 
of the Dtk receptor is fused in-frame to the Fc region of human immunoglobulin 
(IgG) or to part of the human pi chain of IgM. This procedure follows that 
described by Goodwin et al., Cell 73 : 447-456 (1993). The fusion protein is 
produced by transfecting the fused genes contained within the expression pED5 c 
vector into COS cells. The fusion protein is purified on Protein A-Sepharose 
columns (Pharmacia). The Dtk-tag fusion proteins are biotinylated using 
sulfosuccinimiddyl-6-biotinamido)-hexanoate (Pierce Chemicals) according to the 
manufacturer's procedures. Alternatively, FITC-conjugated Dtk-tag fusion protein 
is generated by conjugating the fusion protein to FITC using standard techniques 
(see Suda et al.. Cell 75 : 169-1178, 1993). 

The Dtk-tag fusion protein is used to screen for the expression of bound Dtk 
protein on tumor cell lines using flow cytometric techniques. The techniques used 
for the labelling of cells and flow cytometric analysis follow those described by 
Mosley et al., Cell 59: 335-348 (1989). Tumor cells are labelled on ice with the 
biotinylated Dtk-fusion protein using avidin-FITC, or the FrTC-labelled protein is 
used directly in FACS analysis. The screening procedure is aimed at detecting a 
cell line that produces a signal above background with the Dtk-tag fusion protein, 
compared with an unrelated receptor-tag fusion protein. Sequential FACS sorting 
of Dtk ligand-expressing cells is undertaken to generate a high Dtk ligand- 
expressing tumor cell subline which can be used for the generation of a cDNA 
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expression library (for an overview of this strategy see Wong in Genetic 
Engineering Vol. 12 , ed by J K Setlow, 1990). 



E Expression Cloning of the Dtk Ligand 
5 E.l Construction of an expression library 

A random-primed expression library is constructed from poly(A) + mRNA isolated 
from the cell line or tissue demonstrated to give a positive signal in either the 
growth assay, phosphorylation assay or Dtk-tag fusion protein assay outlined 
above. The techniques used for construction of the expression library are standard 
10 procedures for those experienced in the field (see McMahon et al., EMBO J. 10, 

2821-2832, 1991; and Kriegler (supra)). 

E.2 Cloning of the murine or human Dtk ligand 

The expression library constructed from the cell line or tissue is screened by 
15 transfecting pools of cDNAs into COS cells using standard techniques (see 

Sambrook et al., supra). Two approaches are used to detect positive pools, 
depending on whether there has been evidence for either a soluble form of ligand 
or a membrane-bound form of ligand. 



20 Soluble forms: COS supernatants are screened in the detection systems outlined 

above for soluble ligand forms. COS cells are grown in 10 cm plates using 
standard tissue culture techniques. 



25 



Membrane-bound forms: COS cells are grown in LabTech (Nunc) chambers and 
positive pools are detected by using the binding of Dtk-tag fusion protein to the 
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COS cells, followed by detection with either a Protein A-horseradish peroxidase 
enzymatic reaction or Protein A- 12S I binding and subsequent autoradiography. 

Procedures for the breaking down of cDNA pools, subsequent sib selection and 
the isolation of single cDNA clones are outlined in Sambrook et al., (supra) and 
Wong (supra). Sequence analysis of single cDNAs follows standard techniques. 
Once a single cDNA clone is isolated this is transfected into COS cells or into 
CHO cells for large scale production of protein using standard procedures. 

F Application of Ligands for the Dtks of the Invention 

The types of ligand discussed above can be employed in two distinctive methods 

accordance with this invention. 

The first such method is a method of stimulating the proliferation, differentiation 
and/or survival of a cell expressing a Dtk of the invention. This stimulation, 
which can occur in vivo or ex vivo, involves contacting the cell with an effective 
amount of the ligand. 

The ability of a ligand according to the invention to stimulate cells such as stem 
cells which express the Dtk of the invention has important therapeutic 
applications. Such applications include medically treating mammals, including 
humans, whose stem cells do not sufficiently undergo self-renewal. Examples of 
such medical problems which can be treated in this way include those that occur 
when defects in haematopoietic stem cells or their related growth factors depress 
the number of blood cells, leading to disorders such as aplastic anaemia. The 
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treatment of bone marrow damage resulting from cancer chemotherapy and 
radiation is another example of a medical problem that could be treated in this 
way. 

The method can also be applied in stimulating the proliferation, differentiation 
and/or survival of mammalian fetal or adult neuronal cells or cells that form part 
of the central nervous system. Again, this has important therapeutic applications. 
Such applications include treating mammals, including humans, for inherited or 
degenerative disorders of the central nervous system. An additional application is 
the treatment of individuals with central nervous system trauma, for example, 
spinal cord trauma resulting from either crushing or asphyxia. 

Yet a further therapeutic application for the ligands of the invention is in sports 
medicine, particularly in the treatment of muscle injuries. The Dtk of the 
invention is abundantly expressed on myoblast cells but not on mature muscle 
cells. Application of the ligand will stimulate myoblast cell proliferation and 
differentiation, leading to muscle repair. 

In terms of ex vivo applications, the method has implications for gene therapy. In 
gene therapy genes are inserted into host cells (such as haematopoietic stem cells 
and myoblasts) and the expression of the gene regulated by either an endogenous 
or an exogenous promoter. However, it is often difficult to maintain growth and 
survival of these cells ex vivo while they are being manipulated for the insertion of 
foreign genes. Therefore, as the Dtk of the invention is expressed on 
haematopoietic stem cells and myoblasts, the ligand has a direct application in 
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stimulating the growth, proliferation or simple survival of their cells during the 
manipulative process. 

The second distinct method of the invention is a method of inhibiting the function 
of the Dtk of the invention. This method, which would normally be applied in 
vivo for both prophylactic and therapeutic applications, involves contacting the 
receptor with a ligand which blocks or prevents stimulation of the receptor (an 
antagonist ligand). 

In terms of prophylaxis, such a method has specific application to the Sertoli cells 
of the testis, which abundantly express the receptor. Due to the involvement of 
these Sertoli cells in male fertility, contacting the receptors with an antagonistic 
ligand has a potential application in the control of male fertility including in male 
contraception. 

A potential therapeutic application of contacting cells expressing the Dtk of the 
invention with an antagonistic ligand is in anti-tumour therapy. This potential 
application arises from the growing understanding of the role sometimes played by 
RTKs in tumour formation. 

G Therapeutic Applications of Soluble Receptors 

The extracellular receptor domain of the invention as described above also have 
potential therapeutic applications. Such applications are in a method of treating a 
disease, syndrome or condition caused or mediated by an excess of a ligand of the 
invention (whether stimulant or antagonistic). 
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In this method, the extracellular receptor domain of the Dtk in a soluble form can 
be used as a molecular "sponge" or "sink" to remove the excess of the ligand or at 
least to block its activity. 



H Functional Equivalents 

The invention includes functional equivalents of the Dtks, receptor domains, 
nucleic acid molecules and ligands described above. 

The Dtks, extracellular receptor domains and ligands are or include proteins. A 
protein is considered a functional equivalent of another protein for a specific 
function if the equivalent protein is immunologically cross-reactive with, and has 
the same function as, the original protein. The equivalent may, for example, be a 
fragment of the protein, or a substitution, addition or deletion mutant of the 
protein. 

For example, it is possible to substitute amino acids in a sequence with equivalent 
amino acids using conventional techniques. Groups of amino acids known 
normally to be equivalent are: 

(a) Ala(A) Ser(S) Thr(T) Pro(P) Gly(G); 

(b) Asn(N) Asp(D) Glu(E) Gln(Q); 

(c) His(H) Arg(R) Lys(K); 

(d) Met(M) Leu(L) Ile(I) Val(V); and 

(e) Phe(F) Tyr(Y) Trp(W). 
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Substitutions, additions and/or deletions in the receptors and ligands may be made 
as long as the resulting equivalent receptors and ligands are immunologically 
cross-reactive with, and have the same function as, the native receptors and 
ligands. 

The equivalent receptors and ligands will normally have substantially the same 
amino acid sequence as the native receptors and ligands. An amino acid sequence 
that is substantially the same as another sequence, but that differs from the other 
sequence by means of one or more substitutions, additions and/or deletions is 
considered to be an equivalent sequence. Preferably, less than 25%, more 
preferably less than 10%, and most preferably less than 5% of the number of 
amino acid residues in the amino acid sequence of the native receptors and 
ligands are substituted for, added to, or deleted from. 

Equivalent nucleic acid molecules include nucleic acid sequences that encode 
equivalent receptors and ligands as defined above. Equivalent nucleic acid 
molecules also include nucleic acid sequences that, due to the degeneracy of the 
nucleic acid code, differ from native nucleic acid sequences in ways that do not 
affect the corresponding amino acid sequences. 

Those persons skilled in the art will of course appreciate that the above 
description is provided by way of example only and that the invention is limited 
only by the lawful scope of the appended claims. 
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(3) INFORMATION FOR SEQUENCE ID NO. 2: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 
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(2) MOLECULE TYPE: PROTEIN 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 2: 
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ne 


Leu 


Lgu 


Arg 


T itr 

i»ys 


Arg 


Arg 


Lys 


Glu 


Thr 


418 


Arq 


Phe 


Glv 


Gin 


Ala 


Phe 


Asp 


Ser 


Val 


Met 


Ala 


Ajrg 


P 1 tr 

b-iy 


431 


Glu 


Pro 


Ala 


Val 


His 


Phe 


Arg 


Ala 


Ala 


Arg 


Ser 


Phes 
rile 


Asn 


AAA 

444 


Arc 


Glu 


Arg 


Pro 


Glu 


Arg 


Hp 


nil) 


Ala 


Thr- 
ill! 


Lgu 


Asp 


Ser 


457 




rl v 
uj. y 


Tl 0 
lie 


Ocl 


Asp 


Tin 


Leu 


T up 


Pin 
GlU 


Lys 


Leu 


Glu 


Asp 


470 


V G. 1 


Lieu 


lie 


Pro 


Pin 
blU 


Pin 

bin 


pi n 
bin 


f ne 


Thr 


Leu 


Gly 


Arg 


Met 


483 


Leu 




Lys 


oxy 


fill 


rile 


PI u 

b j.y 


oer 


vai 


Arg 


Glu 


Ala 


Gin 


496 


Leu 


Ly s 




OX Li 


A en 


P,l v 


ocir 




V = 1 

vai 


Lys 


val 


Ala 


Val 


509 






Leu 


Lys 


Ala 


Asp 


Tl a 
Xi.6 


Tl a 

lie 


Aia 


Ser 


Ser 


Asp 


He 


522 


ft! U 


VjXU 


r^iie 


Leu 


Arg 


fin 


Aia 


Aia 


Cys 


Met 


Lys 


Glu 


Phe 


535 


Asp 


nlS 


Pro 


His 


val 


Ala 


Lys 


Leu 


Val 


Gly 


Val 


Ser 


Leu 


548 


Arg 


oer 


Arg 


Ala 


Lys 


Gly 


Arg 


Leu 


Pro 


He 


Pro 


Met 


Val 


561 


JL-L6 


Leu 


Pro 


Pne 


Met 


Lys 


His 


Gly 


Asp 


Leu 


His 


Ala 


Phe 


574 


Leu 


Leu 


Aid 


C r\ v 

5er 


Arg 


lie 


Gly 


Glu 


Asn 


Pro 


Phe 


Asn 


Leu 


587 


Pro 


Leu 


Gin 


Thr 


L&u 




Arg 


Dries 

rule 


MrtT- 

Ket 


vai 


Asp 


He 


Ala 


600 


Cys 


Glv 


1 1C L> 


r;l 11 

OX Li 


iyr 


Leu 


Cor 

oer 




Arg 


Asn 


Phe 


He 


His 


613 


rtJ. y 


Asp 


Leu 


Aia 


Aia 


Arg 


Asn 


Cys 


Met 


Leu 


Ala 


Glu 


Asp 


626 


net 


inr 


vai 


Cys 


vai 


Ala 


Asp 


Phe 


Gly 


Leu 


Ser 


Arg 


Lys 


639 


x xe 


Tyr 


ber 


Gly 


Asp 


Tyr 


Tyr 


Arg 


Gin 


Gly 


Cys 


Ala 


Ser 


652 


Lys 


Leu 


Pro 


vai 


Lys 


Trp 


Leu 


Ala 


Leu 


Glu 


Ser 


Leu 


Ala 


665 


Asp 


Asn 


Leu 


Tyr 


lnr 


vai 


His 


Ser 


Asp 


Val 


Trp 


Ala 


Phe 


678 


Glv 


Val 


Ji ilX 


Tic L. 


irp 


OlU 


Tl a 

jl xe 


net 


inr 


Arg 


Gly 


Gin 


Thr 


691 


Pro 


lyr 


Al a 
nla 


oxy 


Tl a 
± ic 


pi »i 

ulU 


Asn 


Aia 


pi «i 
Glu 


Tl - 

lie 


Tyr 


Asn 


Tyr 


704 


Leu 


lie 


OX jr 


oxy 


noil 


Arg 


Leu 


Lys 


Pi 

bin 


Pro 


Pro 


Glu 


Cys 


717 


Met 


Ul Li 


ulU 


Vol 


Tyr 


Asp 


Leu 


net 


Tyr 


Gin 


Cys 


Trp 


Ser 


730 


Ala 


Asp 


Pro 


Lys 


bin 


Arg 


Pro 


Ser 


Pne 


Thr 


Cys 


Leu 


Arg 


743 


Meal- ' 


Pin 
uiU 


Leu 


Pi »i 
olU 


Asn 


Tl - 

lie 


Leu 


p 1 >• 

Gly 


His 


Leu 


Ser 


Val 


Leu 


756 


Ser 


Thr 
mi. 


OCX 


m n 

0111 


Asp 


Pro 


Leu 


Tyr 


lie 


,Asn 


He 


Glu 


Arg 


769 


Ala 


uiu 


bin 


Pro 


Thr- 

inr 


bill 


oer 


pi • » 

uly 


Ser 


Pro 


Glu 


Leu 


His 


782 




OX jr 


OX Li 


ni. y 




del 


Car 
Del 


Tin 

blU 


Ala 


Gly 


Asp 


Gly 


Ser 


795 


Gly 


Val 


Gly 


Ala 


Val 


Gly 


Gly 


He 


Pro 


Ser 


Asp 


Ser 


Arg 


808 


Tyr 


He 


Phe 


Ser 


Pro 


Gly 


Gly 


Leu 


Ser 


Glu 


Ser 


Pro 


Gly 


821 


Gin 


Leu 


Glu 


Gin 


Gin 


Pro 


Glu 


Ser 


Pro 


Leu 


Asn 


Glu 


Asn 


834 


Gin 


Arg 


Leu 


Leu 


Leu 


Leu 


Gin 


Gin 


Gly 


Leu 


Leu 


Pro 


His 


847 



WO 94/19463 



-47- 



PCT/NZ94/00009 



Ser Ser Cys 850 

(4) INFORMATION FOR SEQUENCE ID NO. 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: PROTEIN 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 3: 



Met 


Gly 


Arg 


Pro 


Gly 


Leu 


Pro 


Pro 


Leu 


Pro 


Leu 


Pro 


Pro 


13 


Pro 


Pro 


Arg 


Leu 


Gly 


Leu 


Leu 


Leu 


Ala 


Glu 


Ser 


Ala 


Ala 


26 


Ala 


Gly 


Leu 


Lys 


Leu 


Met 


Gly 


Ala 


Pro 


Val 


Lys 


Leu 


Thr 


39 


Val 


Ser 


Gin 


Gly 


Gin 


Pro 


Val 


Lvs 


Leu 


Asn 


Cvs 


Ser 


Val 


52 


Glu 


Gly 


Met 


Glu 


Glu 


Pro 


Asp 


He 


Gin 


Trp 


Val 


Lys 


Asp 

ST 


65 


Gly 


Ala 


Val 


Val 


Gin 


Asn 


Leu 


Asp 


Gin 


Leu 


Tyr 


He 


Pro 


78 


Val 


Ser 


Glu 


Gin 


His 


Trp 


He 


Gly 


Phe 


Leu 


Ser 


Leu 


Lys 


91 


Ser 


Val 


Glu 


Arg 


Ser 


Asp 


Ala 


Gly 


Arg 


Tyr 


Trp 


Cys 


Gin 


104 


Val 


Glu 


Asp 


Gly 


Gly 


Glu 


Thr 


Glu 


He 


Ser 


Gin 


Pro 


Val 


1 1 7 


Trp 


Leu 


Thr 


Val 


Glu 


Gly 


Val 


Pro 


Phe 


Phe 


Thr 


Val 


Glu 


130 


Pro 


Lys 


Asp 


Leu 


Ala 


Val 


Pro 


Pro 


Asn 


Ala 


Pro 


Phe 


Gin 


1 43 


T Ql 1 


Car 


uys 


pin 

ulU 




vax 


uxy 


Pro 


Pro 


oXU 


Pro 


vai 


Tnr 


1 56 


He 


Val 


Trp 


Trp 


Arg 


Gly 


Thr 


Thr 


Lys 


He 


Gly 


Gly 


Pro 


169 


Ala 


Pro 


Ser 


Pro 


Ser 


Val 


Leu 


Asn 


Val 


Thr 


Gly 


Val 


Thr 


182 


Gin 


Ser 


Thr 


Met 


Phe 


Ser 


Cys 


Glu 


Ala 


His 


Asn 


Leu 


Lys 


195 


Gly 


Leu 


Ala 


Ser 


Ser 


Arg 


Thr 


Ala 


Thr 


Val 


His 


Leu 


Gin 


208 


Ala 


Leu 


Pro 


Ala 


Ala 


Pro 


Phe 


Asn 


He 


Thr 


Val 


Thr 


Lys 


221 


Leu 


Ser 


Ser 


Ser 


Asn 


Ala 


Ser 


Val 


Ala 


Trp 


Met 


Pro 


Gly 


234 


Ala 


Asp 


Gly 


Arg 


Ala 


Leu 


Leu 


Gin 


Ser 


Cys 


Thr 


Val 


Gin 


247 


Val 


Thr 


Gin 


Ala 


Pro 


Gly 


Gly 


Trp 


Glu 


Val 


Leu 


Ala 


Val 


260 


Val 


Val 


Pro 


Val 


Pro 


Pro 


Phe 


Thr 


Cys 


Leu 


Leu 


Arg 


Asp 


273 


Leu 


Val 


Pro 


Ala 


Thr 


Asn 


Tyr 


Ser 


Leu 


Arg 


Val 


Arg 


Cys 


286 


Ala 


Asn 


Ala 


Leu 


Gly 


Pro 


Ser 


Pro 


Tyr 


Ala 


Asp 


Trp 


Val 


299 


Pro 


Phe 


Gin 


Thr 


Lys 


Gly 


Leu 


Ala 


Pro 


Ala 


Ser 


Ala 


Pro 


312 


Gin 


Asn 


Leu 


His 


Ala 


He 


Arg 


Thr 


Asp 


Ser 


Gly 


Leu 


He 


325 
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Leu 


Glu 


Trp 


Gly 


Pro 


Leu 


Asn 


Gly 


Thr 


Ala 


Asn 


Leu 


Val 


Arg 


Val 


Trp 


Ser 


Gin 


Gly 


Gin 


Gin 


Pro 


Val 


Val 


Ala 


Ala 


Leu 


Glu 


Thr 


Arg 


Arg 


Gly 


Glu 


Phe 


Asn 


Arg 


Asp 


Ser 


Leu 


Glu 


Asp 


Val 


Arg 


Met 


Leu 


Ala 


Gin 


Leu 


Ala 


Val 


Lys 


Asp 


He 


Glu 


Glu 


Phe 


Asp 


Ser 


Leu 


Arg 


Met 


Val 


He 


Ala 


Phe 


Leu 


Asn 


Leu 


Pro 


He 


Ala 


Cys 


He 


His 


Arg 


Glu 


Asp 


Met 


Arg 


Lys 


He 


Ala 


Ser 


Lys 


Leu 


Ala 


Asp 


Ala 


Phe 


Gly 


Gin 


Thr 


Pro 


Asn 


Tyr 


Leu 


Glu 


Cys 


Met 


Trp 


Ser 


Ala 



Glu 


Glu 


Val 


Gly 


Pro 


Tyr 


Gin 


Asp 


Glu 


Thr 


Gly 


Trp 


Cys 


Val 


Ser 


Pro 


Leu 


val 


Gly 


Pro 


Pro 


Leu 


Gly 


Val 


Ala 


Leu 


He 


Phe 


Gly 


Gin 


Pro 


Ala 


Val 


Glu 


Arg 


Pro 


Gly 


He 


Ser 


Leu 


He 


Pro 


Gly 


Lys 


Gly 


Lys 


Gin 


Glu 


Met 


Leu 


Lys 


Glu 


Phe 


Leu 


His 


Pro 


His 


Ser 


Arg 


Ala 


Leu 


Pro 


Phe 


Leu 


Ala 


Ser 


Leu 


Gin 


Thr 


Gly 


Met 


Glu 


Asp 


Leu 


Ala 


Thr 


Val 


Cys 


Tyr 


Ser 


Gly 


Leu 


Pro 


Val 


Asn 


Leu 


Tyr 


Val 


Thr 


Met 


Tyr 


Ala 


Gly 


He 


Gly 


Gly 


Glu 


Asp 


Val 


Asp 


Pro 


Lys 



He 


Pro 


Glu 


Lys 


Leu 


Ser 


Leu 


Thr 


Val 


Asp 


Pro 


Gin 


Asn 


Ala 


Val 


Val 


Ser 


Ser 


His 


Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Leu 


Arg 


Ala 


Phe 


Asp 


His 


Phe 


Arg 


Glu 


Arg 


He 


Asp 


Glu 


Leu 


Glu 


Gin 


Gin 


Glu 


Phe 


Gly 


Asp 


Gly 


Ser 


Ala 


Asp 


He 


Arg 


Glu 


Ala 


Val 


Ala 


Lys 


Lys 


Gly 


Arg 


Met 


Lys 


His 


Arg 


He 


Gly 


Leu 


He 


Arg 


Tyr 


Leu 


Ser 


Ala 


Arg 


Asn 


Val 


Ala 


Asp 


Asp 


Tyr 


Tyr 


Lys 


Trp 


Leu 


Thr 


Val 


Gin 


Trp 


Glu 


He 


He 


Glu 


Asn 


Asn 


Arg 


Leu 


Tyr 


Asp 


Leu 


Gin 


Arg 


Pro 



Ala 


Pro 


Leu 


Trp 


Val 


Gin 


Glu 


Gly 


Thr 


Lys 


Asp 


Leu 


Gly 


Cys 


Gly 


His 


Asp 


Arg 


Thr 


Ser 


Trp 


Leu 


Val 


Thr 


Lys 


Arg 


Arg 


Ser 


Val 


Met 


Ala 


Ala 


Arg 


Glu 


Ala 


Thr 


Lys 


Glu 


Lys 


Phe 


Thr 


Leu 


Ser 


Val 


Arg 


Phe 


Val 


Lys 


He 


Ala 


Ser 


Ala 


Cys 


Met 


Leu 


Val 


Gly 


Leu 


Pro 


He 


Gly 


Asp 


Leu 


Glu 


Asn 


Pro 


Phe 


Met 


Val 


Ser 


Arg 


Asn 


Cys 


Met 


Leu 


Phe 


Gly 


Leu 


Arg 


Gin 


Gly 


Ala 


Leu 


Glu 


Ser 


Asp 


Val 


Met 


Thr 


Arg 


Ala 


Glu 


He 


Lys 


Gin 


Pro 


Met 


Tyr 


Gin 


Ser 


Phe 


Thr 



Glu 


338 


Asp 


351 


Arg 


^64 


He 


377 


Pro 


390 


Ala 


403 


val 


416 


Ala 


429 


Lys 


442 


Ala 


455 


Ser 


468 


Leu 


481 


Leu 


494 


Gly 


507 


Glu 


520 


Val 


533 


Ser 


546 


Lys 


559 


Val 


572 


Pro 


585 


His 


598 


Phe 


611 


Asp 


624 


Phe 


637 


Ala 


650 


Ser 


663 


Cys 


676 


Ser 


689 


Trp 


702 


Gly 


715 


Tyr 


728 


Pro 


741 


Cys 


754 


Cys 


767 
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Arg 


Met 


Glu 


Leu 


Glu 


Asn 


lie 


Leu 


Gly 


Gin 


Leu 


Ser 


780 


Val 


Leu 


Ser 


Ala 


Ser 


Gin 


Asp 


Pro 


Leu 


Tyr 


He 


Asn 


He 


793 


Glu 


Aro 


Ala 


Glu 


Glu 


Pro 


Thr 


Val 


Gly 


Gly 


Ser 


Leu 


Glu 


806 


Leu 


Pro 


Glv 


Arg 


Asp 


Gin 


Pro 


Tvr 


Ser 


Gly 


Ala 


Gly 


Asp 


819 


Glv 


Ser 


Glv 


Met 


Glv 


Ala 


Val 


Gly 


Gly 


Thr 


Pro 


Ser 


Asp 


832 


Cys 


Arg 


Tyr 


lie 


Leu 


Thr 


pro 


Gly 


i>iy 


Leu 


Aia 




bin 




Pro 


Gly 


Gin 


Ala 


Glu 


His 


Gin 


Pro 


Glu 


Ser 


Pro 


Leu 


Asn 


858 


Glu 


Thr 


Gin 


Arg 


Leu 


Leu 


Leu 


Leu 


Gin 


Gin 


Gly 


Leu 


Leu 


871 


Pro 


His 


Ser 


Ser 


Cys 


















876 



(5) INFORMATION FOR SEQUENCE ID NO. 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 850 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: PROTEIN 



(3) SEQUENCE DESCRIPTION: SEQ ID NO. 4: 



Ala 


Gly 


Leu 


Lys 


Leu 


Met 


Gly 


Ala 


Pro 


Val 


Lys 


Leu 


Thr 


13 


val 


Ser 


•Gin 


Gly 


Gin 


Pro 


Val 


Lys 


Leu 


Asn 


Cys 


Ser 


Val 


26 


Glu 


Gly 


Met 


Glu 


Glu 


Pro 


Asp 


He 


Gin 


Trp 


Val 


Lys 


Asp 


39 


Gly 


Ala 


Val 


Val 


Gin 


Asn 


Leu 


Asp 


Gin 


Leu 


Tyr 


He 


Pro 


52 


Val 


Ser 


Glu 


Gin 


His 


Trp 


He 


Gly 


Phe 


Leu 


Ser 


Leu 


Lys 


65 


Ser 


Val 


Glu 


Arg 


Ser 


Asp 


Ala 


Gly 


Arg 


Tyr 


Trp 


Cys 


Gin 


78 


Val 


Glu 


Asp 


Gly 


Gly 


Glu 


Thr 


Glu 


He 


Ser 


Gin 


Pro 


Val 


91 


Tip 


Leu 


Thr 


Val 


Glu 


Gly 


Val 


Pro 


Phe 


Phe 


Thr 


Val 


Glu 


104 


Pro 


Lys 


Asp 


Leu 


Ala 


Val 


Pro 


Pro 


Asn 


Ala 


Pro 


Phe 


Gin 


117 


Leu 


Ser 


Cys 


Glu 


Ala 


Val 


Gly 


Pro 


Pro 


Glu 


Pro 


Val 


Thr • 


130 


He 


Val 


Trp 


Trp 


Arg 


Gly 


Thr 


Thr 


Lys 


He 


Gly 


Gly 


Pro 


143 


Ala 


Pro 


Ser 


Pro 


Ser 


Val 


Leu 


Asn 


Val 


Thr 


Gly 


Val 


Thr 


156 


Gin 


Ser 


Thr 


Met 


Phe 


Ser 


Cys 


Glu 


Ala 


His 


Asn 


Leu 


Lys 


169 


Gly 


Leu 


Ala 


Ser 


Ser 


Arg 


Thr 


Ala 


Thr 


Val 


His 


Leu 


Gin 


182 


Ala 


Leu 


Pro 


Ala 


Ala 


Pro 


Phe 


Asn 


He 


Thr 


Val 


Thr 


Lys 


195 


Leu 


Ser 


Ser 


Ser 


Asn 


Ala 


Ser 


Val 


Ala 


Trp 


Met 


Pro 


Gly 


208 


Ala 


Asp 


Gly 


Arg 


Ala 


Leu 


Leu 


Gin 


Ser 


Cys 


Thr 


Val 


Gin 


221 


val 


Thr 


Gin 


Ala 


Pro 


Gly 


Gly 


Trp 


Glu 


Val 


Leu 


Ala 


Val 


234 
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Val 


Val 


Pro 


Val 


Pro 


Pro 


Phe 


Leu 


Val 


Pro 


Ala 


Thr 


Asn 


Tyr 


Ala 


Asn 


Ala 


Leu 


Gly 


Pro 


Ser 


Pro 


Phe 


Gin 


Thr 


Lys 


Gly 


Leu 


Gin 


Asn 


Leu 


His 


Ala 


He 


Arg 


Leu 


Glu 


Trp 


Glu 


Glu 


Val 


He 


Gly 


Pro 


Leu 


Gly 


Pro 


Tyr 


Lys 


Asn 


Gly 


Thr 


Gin 


Asp 


Glu 


Leu 


Ala 


Asn 


Leu 


Thr 


Gly 


Trp 


Asp 


Val 


Arg 


Val 


Cys 


Val 


Ser 


Asn 


Trp 


Ser 


Gin 


Pro 


Leu 


Val 


Val 


Gly 


Gin 


Gin 


Gly 


Pro 


Pro 


His 


Pro 


Val 


Val 


Leu 


Gly 


Val 


Leu 


Ala 


Ala 


Leu 


Ala 


Leu 


He 


Leu 


Glu 


Thr 


Arg 


Phe 


Gly 


Gin 


Ala 


Arg 


Gly 


Glu 


Pro 


Ala 


Val 


His 


Phe 


Asn 


Arg 


Glu 


Arg 


Pro 


Glu 


Asp 


Ser 


Leu 


Gly 


He 


Ser 


Asp 


Glu 


Asp 


Val 


Leu 


He 


Pro 


Glu 


Arg 


Met 


Leu 


Gly 


Lys 


Gly 


Glu 


Ala 


Gin 


Leu 


Lys 


Gin 


Glu 


Asp 


Ala 


Val 


Lys 


Met 


Leu 


Lys 


Ala 


Asp 


He 


Glu 


Glu 


Phe 


Leu 


Arg 


Glu 


Phe 


Asp 


His 


Pro 


His 


Val 


Ser 


Leu 


Arg 


Ser 


Arg 


Ala 


Lys 


Met 


Val 


He 


Leu 


Pro 


Phe 


Met 


Ala 


Phe 


Leu 


Leu 


Ala 


Ser 


Arg 


Asn 


Leu 


Pro 


Leu 


Gin 


Thr 


Leu 


lie 


Ala 


Cys 


Gly 


Met 


Glu 


Tyr 


He 


His 


Arg 


Asp 


Leu 


Ala 


Ala 


Glu 


Asp 


Met 


Thr 


Val 


Cys 


Val 


Arg 


Lys 


He 


Tyr 


Ser 


Gly 


Asp 


Ala 


Ser 


Lys 


Leu 


Pro 


Val 


Lys 


Leu 


Ala 


Asp 


Asn 


Leu 


Tyr 


Thr 
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Thr 


Cys 


Leu 


Leu 


Arg 


Asp 


247 


Ser 


Leu 


Arg 


Val 


Arg 


Cys 


260 


Pro 


Tyr 


Ala 


Asp 


Trp 


Val 


273 


Ala 


Pro 


Ala 


Ser 


Ala 


Pro 


286 


Thr 


Asp 


Ser 


Gly 


Leu 


He 


299 


Pro 


Glu 


Ala 


Pro 


Leu 


Glu 


312 


Leu 


Ser 


Trp 


Val 


Gin 


Asp 


325 


Thr 


Val 


Glu 


Gly 


Thr 


Arg 


338 


Pro 


Gin 


Lys 


Asp 


Leu 


He 


351 


Ala 


Val 


Gly 


Cys 


Gly 


Pro 


364 


Ser 


Ser 


His 


Asp 


Arg 


Ala 


377 


Ser 


Arg 


Thr 


Ser 


Trp 


Val 


390 


Thr 


Ala 


Leu 


Val 


Thr 


Ala 


403 


Leu 


Arg 


Lys 


Arg 


Arg 


Lys 


416 


Phe 


Asp 


Ser 


Val 


Met 


Ala 


429 


Phe 


Arg 


Ala 


Ala 


Arg 


Ser 


442 


Arg 


He 


Glu 


Ala 


Thr 


Leu 


455 


Glu 


Leu 


Lys 


Glu 


Lys 


Leu 


468 


Gin 


Gin 


Phe 


Thr 


Leu 


Gly 


481 


Phe 


Gly 


Ser 


Val 


Arg 


Glu 


494 


Gly 


Ser 


Phe 


Val 


Lys 


Val 


507 


Asp 


He 


He 


Ala 


Ser 


Ser 


520 


Glu 


Ala 


Ala 


Cys 


Met 


Lys 


533 


Ala 


Lys 


Leu 


Val 


Gly 


Val 


546 


Gly 


Arg 


Leu 


Pro 


He 


Pro 


559 


Lys 


His 


Gly 


Asp 


Leu 


His 


572 


He 


Gly 


Glu 


Asn 


Pro 


Phe 


585 


He 


Arg 


Phe 


Met 


Val 


Asp 


598 


Leu 


Ser 


Ser 


Arg 


Asn 


Phe 


611 


Arg 


Asn 


Cys 


Met 


Leu 


Ala 


624 


Ala 


Asp 


Phe" 


Gly 


Leu 


Ser 


637 


Tyr 


Tyr 


Arg 


Gin 


Gly 


Cys 


650 


Trp 


Leu 


Ala 


Leu 


Glu 


Ser 


663 


Val 


Gin 


Ser 


Asp 


Val 


Trp 


676 
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Ala 


Phe 


Glv 


Val 


Thr 


Met 


Trp 


Glu 


He 


Met 


Thr 


Arq 


Gly 


689 


Gin 


Thr 


Pro 


Tvr 


Ala 


Gly 


He 


Glu 


Asn 


Ala 


Glu 


He 


Tyr 


702 


Asn 


Tvr 


Leu 


lie 


Glv 


Glv 


Asn 


Arc 


Leu 


Lvs 


Gin 


Pro 


Pro 


71 s ; 


Glu 


Cys 


Met 


Glu 


Asp 


Val 


Tvr 


Asp 


Leu 


Met 


Tvr 
j 


Gin 


Cys 


17ft 


TrD 


Ser 


Ala 


Asp 


Pro 


Lys 


Gin 


Ara 


Pro 


Ser 


Phe 


Thr 


Cys 


741 


Leu 


Arq 


Met 


Glu 


Leu 


Glu 


Asn 


He 


Leu 


Gly 


Gin 


Leu 


Ser 


754 


Val 


Leu 


Ser 


Ala 


Ser 


Gin 


Asp 


Pro 


Leu 


Tyr 


He 


Asn 


He 


767 


Glu 


Arg 


Ala 


Glu 


Glu 


Pro 


Thr 


Val 


Gly 


Gly 


Ser 


Leu 


Glu 


780 


Leu 


Pro 


Gly 

J 


Arg 


Asp 


Gin 


Pro 


Tyr 


Ser 


Gly 


Ala 


Gly 

J 


Asp 


793 


Gly 


Ser 


Gly 


Met 


Gly 


Ala 


Val 


■ Gly 


Gly 


Thr 


Pro 


Ser 


Asp 


806 


Cys 


Arg 


Tyr 


lie 


Leu 


Tnr 


Pro 


Gly 


Gly 


Leu 


Ala 


GlU 


Gin 


819 


Pro 


Gly 


Gin 


Ala 


Glu 


His 


Gin 


Pro 


Glu 


Ser 


Pro 


Leu 


Asn 


832 


Glu 


Thr 


Gin 


Arg 


Leu 


Leu 


Leu 


Leu 


Gin 


Gin 


Gly 


Leu 


Leu 


845 


Pro 


His 


Ser 


Ser 


Cys 


















850 



(6) INFORMATION FOR SEQUENCE ID NO. 5: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: PROTEIN 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 5: 

Ala Gly 2 



Leu 


Lys 


Leu 


Met 


Gly 


Ala 


Pro 


Val 


Lys 


Met 


Thr 


Val 


Ser 


15 


Gin 


Gly 


Gin 


Pro 


Val 


Lys 


Leu 


Asn 


Cys 


Ser 


Val 


Glu 


Gly 


28 


Met 


Glu 


Asp 


Pro 


Asp 


He 


His 


Trp 


Met 


Lys 


Asp 


Gly 


Thr 


41 


Val 


Val 


Gin 


Asn 


Ala 


Ser 


Gin 


Val 


Ser 


He 


Ser 


He 


Ser 


54 


Glu 


His 


Ser 


Trp 


He 


Gly 


Leu 


Leu 


Ser 


Leu 


Lys 


Ser 


Val 


67 


Glu 


Arg 


Ser 


Asp 


Ala 


Gly 


Leu 


Tyr 


Trp 


Cys 


Gin 


Val 


Lys 


80 


Asp 


Gly 


Glu 


Glu 


Thr 


Lys 


He 


Ser 


Gin 


Ser 


Val 


Trp 


Leu 


93 


Thr 


Val 


Glu 


Gly 


Val 


Pro 


Phe 


Phe 


Thr 


val 


Glu 


Pro 


Lys 


106 


Asp 


Leu 


Ala 


Val 


Pro 


Pro 


Asn 


Ala 


Pro 


Phe 


Gin 


Leu 


Ser 


119 


Cys 


Glu 


Ala 


Val 


Gly 


Pro 


Pro 


Glu 


Pro 


Val 


Thr 


He 


Tyr 


132 


Trp 


Trp 


Arg 


Gly 


Leu 


Thr 


Lys 


Val 


Gly 


Gly 


Pro 


Ala 


Pro 


145 


Ser 


Pro 


Ser 


Val 


Leu 


Asn 


Val 


Thr 


Gly 


Val 


Thr 


Gin 


Arg 


158 
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Thr 


Glu 


Phe 


Ser 


Cys 


Glu 


Ala 


Arg 


Asn 


He 


Lys 


Gly 


Leu 


171 


Ala 


Thr 


Ser 


Arg 


Pro 


Ala 


He 


Val 


Arg 


Leu 


Gin 


Ala 


Pro 


184 


Pro 


Ala 


Ala 


Pro 


Phe 


Asn 


Thr 


Thr 


Val 


Thr 


Thr 


He 


Ser 


197 


Ser 


Tyr 


Asn 


Ala 


Ser 


Val 


Ala 


Trp 


Val 


Pro 


Gly 


Ala 


Asp 


210 


Gly 


Leu 


Ala 


Leu 


Leu 


His 


Ser 


Cys 


Thr 


Val 


Gin 


Val 


Ala 


223 


His 


Ala 


Pro 


Gly 


Glu 


Trp 


Glu 


Ala 


Leu 


Ala 


Val 


Val 


Val 


236 


Pro 


Val 


Pro 


Pro 


Phe 


Thr 


Cys 


Leu 


Leu 


Arg 


Asn 


Leu 


Ala 


249 


Pro 


Ala 


Thr 


Asn 


Tyr 


Ser 


Leu 


Arg 


Val 


Arg 


Cys 


Ala 


Asn 


262 


Ala 


Leu 


Gly 


Pro 


Ser 


Pro 


Tyr 


Gly 


Asp 


Trp 


Val 


Pro 


Phe 


275 


Gin 


Thr 


Lys 


Gly 


Leu 


Ala 


Pro 


Ala 


Arg 


Ala 


Pro 


Gin 


Asn 


288 


Phe 


His 


Ala 


He 


Arg 


Thr 


Asp 


Ser 


Gly 


Leu 


He 


Leu 


Glu 


301 


Trp 


Glu 


Glu 


Val 


He 


Pro 


Glu 


Asp 


Pro 


Gly 


Glu 


Gly 


Pro 


314 


Leu 


Gly 


Pro 


Tyr 


Lys 


Leu 


Ser 


Trp 


Val 


Gin 


Glu 


Asn 


Gly 


327 


Thr 


Gin 


Asp 


Glu 


Leu 


Met 


Val 


Glu 


Gly 


Thr 


Arg 


Ala 


Asn 


340 


Leu 


Thr 


Asp 


Trp 


Asp 


Pro 


Gin 


Lys 


Asp 


Leu 


He 


Leu 


Arg 


353 


Val 


Cys 


Ala 


Ser 


Asn 


Ala 


He 


Gly 


Asp 


Gly 


Pro 


Trp 


Ser 


366 


Gin 


Pro 


Leu 


Val 


Val 


Ser 


Ser 


His 


Asp 


His 


Ala 


Gly 


Arg 


379 


Gin 


Gly 


Pro 


Pro 


His 


Ser 


Arg 














386 



(7) INFORMATION FOR SEQUENCE ID NO. 6: 
(1) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 386 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) TOPOLOGY: LINEAR 





(2) 


MOLECULE TYPE: 


PROTEIN 














(3) 


SEQUENCE DESCRIPTION: SEQ ID NO. 6: 






Ala 


Gly 


Leu 


Lys 


Leu 


Met 


Gly 


Ala 


Pro 


Val 


Lys 


Leu 


Thr 


13 


Val 


Ser 


Gin 


Gly 


Gin 


Pro 


Val 


Lys 


Leu 


Asn 


Cys 


Ser 


Val 


26 


Glu 


Gly 


Met 


Glu 


Glu 


Pro 


Asp 


He 


Gin 


Trp 


Val 


Lys 


Asp 


39 


Gly 


Ala 


Val 


Val 


Gin 


Asn 


Leu 


Asp 


Gin 


Leu 


Tyr 


He 


Pro 


52 


Val 


Ser 


Glu 


Gin 


His 


Trp 


He 


Gly 


Phe 


Leu" 


Ser 


Leu 


Lys 


65 


Ser 


Val 


Glu 


Arg 


Ser 


Asp 


Ala 


Gly 


Arg 


Tyr 


Trp 


Cys 


Gin 


78 


Val 


Glu 


Asp 


Gly 


Gly 


Glu 


Thr 


Glu 


He 


Ser 


Gin 


Pro 


Val 


91 


Trp 


Leu 


Thr 


Val 


Glu 


Gly 


Val 


Pro 


Phe 


Phe 


Thr 


Val 


Glu 


104 


Pro 


Lys 


Asp 


Leu 


Ala 


Val 


Pro 


Pro 


Asn 


Ala 


Pro 


Phe 


Gin 


117 
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Leu 


Ser 


Cys 


Glu 


Ala 


Val 


Gly 


Pro 


Pro 


Glu 


Pro 


Val 


Thr 


130 


He 


Val 


Trp 


Trp 


Arg 


Gly 


Thr 


Thr 


Lys 


He 


Gly 


Gly 


Pro 


143 


Ala 


Pro 


Ser 


Pro 


Ser 


Val 


Leu 


Asn 


Val 


Thr 


Gly 


Val 


Thr 


156 


Gin 


Ser 


Thr 


Met 


Phe 


Ser 


Cys 


Glu 


Ala 


His 


Asn 


Leu 


Lys 


169 


Gly 


Leu 


Ala 


Ser 


Ser 


Arg 


Thr 


Ala 


Thr 


Val 


His 


Leu 


Gin 


182 


Ala 


Leu 


Pro 


Ala 


Ala 


Pro 


Phe 


Asn 


He 


Thr 


Val 


Thr 


Lys 


195 


Leu 


Ser 


Ser 


Ser 


Asn 


Ala 


Ser 


Val 


Ala 


Trp 


Met 


Pro 


Gly 


208 


Ala 


Asp 


Gly 


Arg 


Ala 


Leu 


Leu 


Gin 


Ser 


Cys 


Thr 


Val 


Gin 


221 


Val 


Thr 


Gin 


Ala 


Pro 


Gly 


Gly 


Trp 


Glu 


Val 


Leu 


Ala 


Val 


234 


Val 


Val 


Pro 


Val 


Pro 


Pro 


Phe 


Thr 


Cys 


Leu 


Leu 


Arg 


Asp 


247 


Leu 


Val 


Pro 


Ala 


Thr 


Asn 


Tyr 


Ser 


Leu 


Arg 


Val 


Arg 


Cys 


260 


Ala 


Asn 


Ala 


Leu 


Gly 


Pro 


Ser 


Pro 


Tyr 


Ala 


Asp 


Trp 


Val 


273 


Pro 


Phe 


Gin 


Thr 


Lys 


Gly 


Leu 


Ala 


Pro 


Ala 


Ser 


Ala 


Pro 


286 


Gin 


Asn 


Leu 


His 


Ala 


He 


Arg 


Thr 


Asp 


Ser 


Gly 


Leu 


He 


299 


Leu 


Glu 


Trp 


Glu 


Glu 


Val 


He 


Pro 


Glu 


Ala 


Pro 


Leu 


Glu 


312 


Gly 


Pro 


Leu 


Gly 


Pro 


Tyr 


Lys 


Leu 


Ser 


Trp 


Val 


Gin 


Asp 


325 


Asn 


Gly 


Thr 


Gin 


Asp 


Glu 


Leu 


Thr 


Val 


Glu 


Gly 


Thr 


Arg 


338 


Ala 


Asn 


Leu 


Thr 


Gly 


Trp 


Asp 


Pro 


Gin 


Lys 


Asp 


Leu 


He 


351 


Val 


Arg 


Val 


Cys 


Val 


Ser 


Asn 


Ala 


Val 


Gly 


Cys 


Gly 


Pro 


364 


Trp 


Ser 


Gin 


Pro 


Leu 


Val 


Val 


Ser 


Ser 


His 


Asp 


Arg 


Ala 


377 


Gly 


Gin 


Gin 


Gly 


Pro 


Pro 


His 


Ser 


Arg 










386 



(8) INFORMATION FOR SEQUENCE ID NO. 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3919 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 7: 



GGCACGAGTGTGGAAGGAGCGCGGTGGCCCAGCCGCAGCCCCGGGGACTCCTCGCTGCTG 6 0 

ACGGCGGTGGCCGCGGCTCTAGGCGGCCGCGGGTCCCGGACGCCCCGGCCGAGCGCCGCC 1 20 

CCCCGCCCCTCCCGCGGGCCTCCCGCCCCTCCTCCGCCACCCTCCTCTCAGCGCTCGCGG 1 80 

GCCGGGCCCGGCATGGTGCGGCGTCGCCGCCGATGGCGCTGAGGCGGAGC 230 

Met Gly Trp Pro Gly Leu Arg Pro Leu Leu Leu Ala Gly 

ATG GGG TGG CCG GGG CTC CGG CCG CTG CTG CTG GCG GGA 269 
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Leu Ala Ser Leu Leu Leu Pro Gly Ser Ala Ala Ala Gly 

CTG GCT TCT CTG CTG CTC CCC GGG TCT GCG GCC GCA GGC 

Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 

CTG AAG CTC ATG GGC GCC CCA GTG AAG ATG ACC GTG TCT 

Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly 

CAG GGG CAG CCA GTG AAG CTC AAC TGC AGC GTG GAG GGG 



Met Glu Asp Pro Asp lie 

ATG GAG GAC CCT GAC ATC 

Val Val Gin Asn Ala Ser 

GTG GTC CAG AAT GCA AGC 

Glu His Ser Trp lie Gly 

GAG CAC AGC TGG ATT GGC 

Glu Arg Ser Asp Ala Gly 

GAG CGG TCT GAT GCT GGC 



His Trp Met Lys Asp Gly Thr 

CAC TGG ATG AAG GAT GGC ACC 

Gin Val Ser lie Ser He Ser 

CAG GTG TCC ATC TCC ATC AGC 

Leu Leu Ser Leu Lys Ser Val 

TTA CTC AGC CTA AAG TCA GTG 

Leu Tyr Trp Cys Gin Val Lys 

CTG TAC TGG TGC CAG GTG AAG 



Asp Gly Glu Glu Thr Lys He Ser Gin Ser Val Trp Leu 

GAT GGG GAG GAA ACC AAG ATC TCT CAG TCA GTA TGG CTC 

Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro Lys 

ACT GTC GAA GGT GTG CCA TTC TTC ACA GTG GAA CCA AAA 

Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin Leu Ser 

GAT CTG GCG GTG CCA CCC AAT GCC CCT TTT CAG CTG TCT 

Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr He Tyr 

TGT GAG GCT GTG GGT CCT CCA GAA CCC GTA ACC ATT TAC 

Trp Trp Arg Gly Leu Thr Lys Val Gly Gly Pro Ala Pro 

TGG TGG AGA GGA CTC ACT AAA GTT GGG GGA CCT GCT CCC 

Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr Gin Arq 

TCT CCC TCT GTT TTA AAT GTG ACA GGA GTG ACC CAG CGC 



Thr Glu Phe Ser Cys Glu 

ACA GAG TTT TCT TGT GAA 

Ala Thr Ser Arg Pro Ala 

GCC ACT TCC CGA CCA GCC 

Pro Ala Ala Pro Phe Asn 

CCT GCA GCT CCT TTC AAC 

Ser Tyr Asn Ala Ser Val 

AGC TAC AAC GCT AGC GTG 

Gly Leu Ala Leu Leu His 

GGC CTA GCT CTG CTG CAT 

His Ala Pro Gly Glu Trp 

CAC GCC CCA GGA GAA TGG 

Pro Val Pro Pro Phe Thr 

CCT GTG CCA CCT TTT ACC 

Pro Ala Thr Asn Tyr Ser 

CCT GCC ACC AAC TAC AGC 

Ala Leu Gly Pro Ser Pro 

GCC TTG GGC CCT TCT CCC 

Gin Thr Lys Gly Leu Ala 

CAG ACA AAG GGC CTA GCG 



Ala Arg Asn He Lys Gly Leu 

GCC CGC AAC ATA AAA GGC CTG 

He Val Arg Leu Gin Ala Pro 

ATT GTT CGC CTT CAA GCA CCG 

Thr Thr Val Thr Thr He Ser 

ACC ACA GTA ACA ACG ATC TCC 

Ala Trp Val Pro Gly Ala Asp 

GCC TGG GTG CCA GGT GCT GAC 

Ser Cys Thr Val Gin Val Ala 

TCC TGT ACT GTA CAG GTG GCA 

Glu Ala Leu Ala Val Val Val 

GAG GCC CTT GCT GTT GTG GTT 

Cys Leu Leu Arg Asn Leu Ala 

TGC CTG CTT CGG AAC TTG GCC 

Leu Arg Val Arg Cys Ala Asn 

CTT AGG GTG CGC TGT GCC AAT 

Tyr Gly Asp Trp Val Pro Phe 

TAC GGC GAC TGG GTG CCC TTT 

Pro Ala Arg Ala Pro Gin Asn 

CCA GCC AGA GCT CCT CAG AAT 



308 
347 
386 
425 
464 
503 
542 
581 
620 
659 
698 
737 
776 
815 
854 
893 
932 
971 
1010 
1049 
1088 
1 127 
1 166 
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Phe 
TTC 


His 
CAT 


Ala 
GCC 


He 
ATT 


Arg 
CGT 


Thr 
ACC 


Asp 
GAC 


Ser 
TCA 


Gly 
GGC 


Leu 
CTT 


He 
ATC 


Leu 
CTG 


Glu 
GAA 


1205 


Trp 
TGG 


Glu 
GAA 


Glu 
GAA 


Val 
GTG 


He 
ATT 


Pro 
CCT 


Glu 
GAG 


Asp 
GAC 


Pro 
CCT 


Gly 
GGG 


Glu 
GAA 


Gly 
GGC 


Pro 
CCC 


1244 


Leu 
CTA 


Gly 
GGA 


Pro 
CCT 


Tyr 
TAT 


Lys 
AAG 


Leu 
CTG 


Ser 
TCC 


Trp 
TGG 


Val 
GTC 


Gin 
CAA 


Glu 
GAA 


Asn 
AAT 


Gly 
GGA 


1283 


Thr 
ACC 


Gin 
CAG 


Asp 
GAT 


Glu 
GAG 


Leu 
CTG 


Met 
ATG 


Val 
GTG 


Glu 
GAA 


Gly 
GGG 


Thr 
ACC 


Arg 
AGG 


Ala 
GCC 


Asn 
AAT 


1322 


Leu 
CTG 


Thr 
ACC 


Asp 
GAC 


Trp 
TGG 


Asp 
GAT 


Pro 
CCC 


Gin 
CAG 


Lys 
AAG 


Asp 
GAC 


Leu 
CTG 


He 
ATT 


Leu 
TTG 


Arg 
CGT 


1361 


Val 
GTG 


Cys 
TGT 


Ala 
GCC 


Ser 
TCC 


Asn 
AAT 


Ala 
GCA 


He 
ATT 


Gly 
GGT 


Asp 
GAT 


Gly 
GGG 


Pro 
CCC 


Trp 
TGG 


Ser 
AGT 


1400 


Gin 
CAG 


Pro 
CCA 


Leu 
CTG 


Val 
GTG 


Val 
GTG 


Ser 
TCT 


Ser 
TCT 


His 
CAT 


Asp 
GAC 


His 
CAT 


Ala 
GCA 


Gly 
GGG 


Arg 
AGG 


1439 


Gin 
CAG 


Gly 
GGC 


Pro 
CCT 


Pro 
CCC 


His 
CAC 


Ser 
AGC 


Arg 
CGC 


Thr 
ACA 


Ser 
TCC 


Trp 
TGG 


Val 
GTG 


Pro 
CCT 


Val 
GTG. 


1478 


Val 
GTC 


Leu 
CTG 


Gly 
GGC 


Val 
GTG 


Leu 
CTC 


Thr 
ACC 


Ala 
GCC 


Leu 
CTG 


He 
ATC 


Thr 
ACA 


Ala 
GCT 


Ala 
GCT 


Ala 

GCC 


1517 


Leu 
TTG 


Ala 
GCC 


Leu 
CTC 


He 
ATC 


Leu 
CTG 


Leu 
CTT 


Arg 
CGG 


Lys 
AAG 


Arg 
AGA 


Arg 
CGC 


Lys 
AAG 


Glu 
GAG 


Thr 
ACG 


1556 


Arg 
CGT 


Phe 
TTC 


Gly 
GGG 


Gin 
CAA 


Ala 
GCC 


Phe 
TTT 


Asp 
GAC 


Ser 
AGT 


Val 
GTC 


Met 
ATG 


Ala 
GCC 


Arg 
CGA 


Gly 
GGG 


1595 


Glu 
GAG 


Pro 
CCA 


Ala 
GCT 


Val 
GTA 


His 
CAC 


Phe 
TTC 


Arg 
CGG 


Ala 
GCA 


Ala 
GCC 


Arg 
CGA 


Ser 
TCT 


Phe 
TTC 


Asn 
AAT 


1634 


Arg 
CGA 


Glu 
GAA 


Arg 
AGG 


Pro 
CCT 


Glu 
GAA 


Arg 
CGC 


He 
ATT 


Glu 
GAG 


Ala 
GCC 


Thr 
ACA 


Leu 
TTG 


Asp 
GAT 


Ser 
AGC 


1673 


Leu 
CTG 


Gly 
GGC 


He 
ATC 


Ser 
AGC 


Asp 
GAT 


Glu 
GAA 


Leu 
TTG 


Lys 
AAG 


Glu 
GAA 


Lys 
AAG 


Leu 
CTG 


Glu 
GAG 


Asp 
GAT 


1712 


Val 
GTC 


Leu 
CTC 


He 
ATT 


Pro 
CCA 


Glu 
GAG 


Gin 
CAG 


Gin 
CAG 


Phe 
TTC 


Thr 
ACC 


Leu 
CTC 


Gly 
GGT 


Arg 
CGG 


Met 
ATG 


1751 


Leu 
TTG 


Gly 
GGC 


Lys 
AAA 


Gly 
GGA 


Glu 
GAG 


Phe 
TTT 


Gly 
GGA 


Ser 
TCA 


Val 
GTG 


Arg 
CGG 


Glu 
GAA 


Ala 
GCC 


Gin 
CAG 


1790 


Leu 
CTA 


Lys 
AAG 


Gin 
CAG 


Glu 
GAA 


Asp 
GAT 


Gly 
GGC 


Ser 
TCC 


Phe 
TTC 


Val 
GTG 


Lys 
AAA 


Val 
GTG 


Ala 
GCA 


Val 
GTG 


1829 


Lys 
AAG 


Met 
ATG 


Leu 
CTG 


Lys 
AAA 


Ala 
GCT 


Asp 
GAC 


He 
ATC 


He 
ATT 


Ala 
GCC 


Ser 
TCA 


Ser 
AGC 


Asp 
GAC 


He 
ATA 


1868 


Glu 
GAA 


Glu 
GAG 


Phe 
TTC 


Leu 
CTC 


Arg 
CGG 


Glu 
GAA 


Ala 
GCA 


Ala 
GCT 


Cys 
TGC 


Met 
ATG 


Lys 
AAG 


Glu 
GAG 


Phe 
TTT 


1907 


GAC 


CAT 


CCA 


Hi Q 

CAC 


vox 

GTG 


Ala 

GCC 


Lys 
AAG 


Leu 
CTT 


Val 
GTT 


filv 
GGG 


Val 
GTG 


AGC 


CTC 


1946 


Arg 
CGG 


Ser 
AGC 


Arg 
AGG 


Ala 
GCT 


Lys 
AAA 


Gly 
GGT 


Arg 
CGT 


Leu 
CTC 


Pro 
CCC 


He 
ATT 


Pro 
CCC 


Met 
ATG 


Val 
GTC 


1985 


He 
ATC 


Leu 
CTG 


Pro 
CCC 


Phe 
TTC 


Met 
ATG 


Lys 
AAA 


His 
CAT 


Gly 
GGA 


Asp 
GAC 


Leu 
TTG 


His 
CAC 


Ala 
GCC 


Phe 
TTT 


2024 


Leu 
CTG 


Leu 
CTC 


Ala 
GCC 


Ser 
TCC 


Arg 
CGA 


He 
ATC 


Gly 
GGG 


Glu 
GAG 


Asn 
AAC 


Pro 
CCT 


Phe 
TTT 


Asn 
AAC 


Leu 
CTG 


2063 
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Ser Ser Cys 
AGT AGC TGT 



TAACCCTCAGGCAGAGGAAAGTTGGGGCCCCTGGCTCTGCTGACCACTGTGCTGCCTGAC 
TAGGCCCAGTCTGATCACAGCCCAGGCAGCAAGGTATGGAGGCTCCTGTGGTAGCCCTCC 
CAAGCTGTGCTGGCGCCTGGACGGACCAAATTGCCCAATCCCAGTTCTTCCTGCAGCCGC 
TCTGGCCAGCCTGGCATCAGTTCAGGCCTTGGCTTAGAGGAGGTGAGCCAGAGCTGGTTG 



2180 
2219 



Pro Leu Gin Thr Leu Val Arg Phe Met Val Asp lie Ala 

CCC CTG CAG ACC CTG GTC CGG TTC ATG GTG GAC ATT GCC 2102 

Cys Gly Met Glu Tyr Leu Ser Ser Arg Asn Phe He His 

TGT GGC ATG GAG TAC CTG AGC TCC CGG AAC TTC ATC CAC 2141 

Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala Glu Asp 

CGA GAC CTA GCA GCT CGG AAT TGC ATG CTG GCC GAG GAC 

Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser Arg Lys 

ATG ACA GTG TGT GTG GCT GAT TTT GGA CTC TCT CGG AAA 

He Tyr Ser Gly Asp Tyr Tyr Arg Gin Gly Cys Ala Ser 

ATC TAT AGC GGG GAC TAT TAT CGT CAG GGC TGT GCC TCC 2258 

Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser Leu Ala 

AAA TTG CCC GTC AAG TGG CTG GCC CTG GAG AGC TTG GCT 2297 

Asp Asn Leu Tyr Thr Val His Ser Asp Val Trp Ala Phe 

GAC AAC TTG TAT ACT GTA CAC AGT GAT GTG TGG GCC TTC 2336 

Gly Val Thr Met Trp Glu He Met Thr Arg Gly Gin Thr 

GGG GTG ACC ATG TGG GAG ATC ATG ACT CGT GGG CAG ACG 2375 

Pro Tyr Ala Gly He Glu Asn Ala Glu He Tyr Asn Tyr 

CCA TAT GCT GGC ATT GAA AAT GCT GAG ATT TAC AAC TAC 2414 

Leu He Gly Gly Asn Arg Leu Lys Gin Pro Pro Glu Cys 

CTC ATC GGC GGG AAC CGC CTG AAG CAG CCT CCG GAG TGC 2453 

Met Glu Glu Val Tyr Asp Leu Met Tyr Gin Cys Trp Ser 

ATG GAG GAA GTG TAT GAT CTC ATG TAC CAG TGC TGG AGC 2492 

Ala Asp Pro Lys Gin Arg Pro Ser Phe Thr Cys Leu Arg 

GCC GAC CCC AAG CAG CGC CCA AGC TTC ACG TGT CTG CGA 2531 

Met Glu Leu Glu Asn He Leu Gly His Leu Ser Val Leu 

ATG GAA CTG GAG AAC ATT CTG GGC CAC CTG TCT GTG CTG 2370 

Ser Thr Ser Gin Asp Pro Leu Tyr He Asn He Glu Arc 

TCC ACC AGC CAG GAC CCC TTG TAC ATC AAC ATT GAG AGA 2609 

Ala Glu Gin Pro Thr Glu Ser Gly Ser Pro Glu Leu His 

GCT GAG CAG CCT ACT GAG AGT GGC AGC CCT GAG CTG CAC 2648 

Cys Gly Glu Arg Ser Ser Ser Glu Ala Gly Asp Gly Ser 

TGT GGA GAG CGA TCC AGC AGC GAG GCA GGG GAC GGC AGT 2687 

Gly Val Gly Ala Val Gly Gly He Pro Ser Asp Ser Arq 

GGC GTG GGG GCA GTA GGT GGC ATC CCC AGT GAC TCT CGG 2726 

Tyr He Phe Ser Pro Gly Gly Leu Ser Glu Ser Pro Gly 

TAC ATC TTC AGC CCC GGA GGG CTA TCC GAG TCA CCA GGG 

Gin Leu Glu Gin Gin Pro Glu Ser Pro Leu Asn Glu Asn 

CAG CTG GAG CAG CAG CCA GAA AGC CCC CTC AAT GAG AAC 

Gin Arg Leu Leu Leu Leu Gin Gin Gly Leu Leu Pro His 

CAG AGG CTG TTG TTG CTG CAG CAA GGG CTA CTG CCT CAC 



2765 
2804 
2843 
2852 



2912 
2972 
3032 
3092 
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CCTGAATGCAGGCAGCTGGCAGGAGGGGAGGGTGGCTATGTTTCCATGGGTACCATGGGT 3152 

GTGGATGGCAGTAAGGGAGGGTAGCAACAGCCCTGTGGGCCCCTACCCTCCTGGCTGAGC 3212 

TGCTCCTACTTTAGTGCATGCTTGGAGCCGCCTGCAGCCTGGAACTCAGCACTGCCCACC 3272 

ACACTTGGGCCGAAATGCCAGGTTTGCCCCTCTTAAGTCACAAAGAGATGTCCATGTATT 3332. 

GTTCCCTTTTAGGTGAT GATTAGG AAGGGATTGGCACACTTGGGTCCCTAAGCCCTATGG 3392 

CAGuAAATGGTGGGATATTCTCAGGTCTGAATCCTCATCATCTTCCTGATTCCCCACCCT 3452 

GCAAAGGCCTGGAACTGGCTGTGGGGCTCTGAGGCATGCTGAAGGACAAAAGATTACAGA 3512 

GATCCGACTTCAAAAGGC AGGGTCTGAGTCTGGCAGGTGGAGAGGTGCTAAGGGGCTGGC 3572 

CCAGG AGTCAGGC ATTTCAGGACCCCTCCAAGCTTCTACAGTCTGTCTG AGCATGCTACC 3632 

AAGCCCCCAGATACCCCAAAACTAACAGAGGCAGTTTTGTCTGAGCCCAGCCCTCCCACA 3692 

TGATGACCCTTAGGTCTACCCTCCTCTCTAAATGGACATCCTCGTTTGTCCCAAGTCTCC 3752 

AGAGAGACT ACTGATGGCTGATGTGGGTAAGAAAAGTTCC AGGAACCAGGGCTGGGGTGG 3812 

AACCAGGGCTGGGGTCGAGGCAGGCTCTTGGGCAGGCTCTTGCTGTTAGGAACATTTCTA 3872 

AGCTATTAAGTTGCTGTTTCAAAACAAATAAAATTGAAACATAAAGA n 3919 



(9) INFORMATION FOR SEQUENCE ID NO. 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 8: 

Ala Gly 

GCA GGC 6 



Leu 


Lys 


Leu 


Met 


Gly 


Ala 


Pro 


Val 


Lys 


Met 


Thr 


Val 


Ser 




CTG 


AAG 


CTC 


ATG 


GGC 


GCC 


CCA 


GTG 


AAG 


ATG 


ACC 


GTG 


TCT 


45 


Gin 


Gly 


Gin 


Pro 


Val 


Lys 


Leu 


Asn 


Cys 


Ser 


Val 


Glu 


Gly 




CAG 


GGG 


CAG 


CCA 


GTG 


AAG 


CTC 


AAC 


TGC 


AGC 


GTG 


GAG 


GGG 


84 


Met 


Glu 


Asp 


Pro 


Asp 


He 


His 


Trp 


Met 


Lys 


Asp 


Gly 


Thr 




ATG 


GAG 


GAC 


CCT 


GAC 


ATC 


CAC 


TGG 


ATG 


AAG 


GAT 


GGC 


ACC 


123 


Val 


Val 


Gin 


Asn 


Ala 


Ser 


Gin 


.Val 


Ser 


He 


Ser 


He 


Ser 




GTG 


GTC 


CAG 


AAT 


GCA 


AGC 


CAG 


GTG 


TCC 


ATC 


TCC 


ATC 


AGC 


162 


Glu 


His 


Ser 


Trp 


He 


Gly 


Leu 


Leu 


Ser 


Leu 


Lys 


Ser 


Val 




GAG 


CAC 


AGC 


TGG 


ATT 


GGC 


TTA 


CTC 


AGC 


CTA 


AAG 


TCA 


GTG 


201 


Glu 


Arg 


Ser 


Asp 


Ala 


Gly 


Leu 


Tyr 


Trp 


Cys 


Gin 


Val 


Lys 




GAG 


CGG 


TCT 


GAT 


GCT 


GGC 


CTG 


TAC 


TGG 


TGC 


CAG 


GTG 


AAG 


240 


Asp 


Gly 


Glu 


Glu 


Thr 


Lys 


He 


Ser 


Gin 


Ser 


Val 


Trp 


Leu 




GAT 


GGG 


GAG 


GAA 


ACC 


AAG 


ATC 


TCT 


CAG 


TCA 


GTA 


TGG 


CTC 


279 


Thr 


Val 


Glu 


Gly 


Val 


Pro 


Phe 


Phe 


Thr 


Val 


Glu 


Pro 


Lys 




ACT 


GTC 


GAA 


GGT 


GTG 


CCA 


TTC 


TTC 


ACA 


GTG 


GAA 


CCA 


AAA 


318 


Asp 


Leu 


Ala 


Val 


Pro 


Pro 


Asn 


Ala 


Pro 


Phe 


Gin 


Leu 


Ser 




GAT 


CTG 


GCG 


GTG 


CCA 


CCC 


AAT 


GCC 


CCT 


TTT 


CAG 


CTG 


TCT 


357 


Cys 


Glu 


Ala 


Val 


Gly 


Pro 


Pro 


Glu 


Pro 


Val 


Thr 


He 


Tyr 




TGT 


GAG 


GCT 


GTG 


GGT 


CCT 


CCA 


GAA 


CCC 


GTA 


ACC 


ATT 


TAC 


396 


Trp 


Trp 


Arg 


Gly 


Leu 


Thr 


Lys 


Val 


Gly 


Gly 


Pro 


Ala 


Pro 




TGG 


TGG 


AG A 


GGA 


CTC 


ACT 


AAA 


GTT 


GGG 


GGA 


CCT 


GCT 


CCC 


435 
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Ser 
TCT 


Pro 
CCC 


Ser 
TCT 


Val 
GTT 


Leu 
TTA 


Asn 
AAT 


Val 
GTG 


Thr 
ACA 


Glu 
GAG 


Phe 
TTT 


Ser 
TCT 


Cys 
TGT 


Glu 
GAA 


Ala 
GCC 


Ala 
GCC 


Thr 
ACT 


Ser 
TCC 


Arg 
CGA 


Pro 
CCA 


Ala 
GCC 


He 
ATT 


Pro 
CCT 


Ala 
GCA 


Ala 
GCT 


Pro 
CCT 


Phe 
TTC 


Asn 
AAC 


Thr 
ACC 


Ser 
AGC 


Tyr 
TAC 


Asn 
AAC 


Ala 
GCT 


Ser 
AGC 


Val 
GTG 


Ala 
GCC 


Gly 
GGC 


Leu 
CTA 


Ala 
GCT 


Leu 
CTG 


Leu 
CTG 


His 
CAT 


Ser 
TCC 


His 
CAC 


Ala 
GCC 


Pro 
CCA 


Gly 
GGA 


Glu 
GAA 


Trp 
TGG 


Glu 
GAG 


Pro 
CCT 


Val 
GTG 


Pro 
CCA 


Pro 
CCT 


Phe 
TTT 


Thr 
ACC 


Cys 
TGC 


Pro 
CCT 


Ala 
GCC 


Thr 
ACC 


Asn 
AAC 


Tyr 
TAC 


Ser 
AGC 


Leu 
CTT 


Ala 
GCC 


Leu 
TTG 


Gly 
GGC 


Pro 
CCT 


Ser 
TCT 


Pro 
CCC 


Tyr 
TAC 


Gin 
CAG 


Thr 
ACA 


Lys 
AAG 


Gly 
GGC 


Leu 
CTA 


Ala 
GCG 


Pro 
CCA 


Phe 
TTC 


His 
CAT 


Ala 
GCC 


He 
ATT 


Arg 
CGT 


Thr 
ACC 


Asp 
GAC 


Trp 
TGG 


Glu 
GAA 


Glu 
GAA 


Val 
GTG 


He 
ATT 


Pro 
CCT 


Glu 
GAG 


Leu 
CTA 


Gly 
GGA 


Pro 
CCT 


Tyr 
TAT 


Lys 
AAG 


Leu 
CTG 


Ser 
TCC 


Thr 
ACC 


Gin 
CAG 


Asp 
GAT 


Glu 
GAG 


Leu 
CTG 


Met 
ATG 


Val 
GTG 


Leu 
CTG 


Thr 
ACC 


Asp 
GAC 


Trp 
TGG 


Asp 
GAT 


Pro 
CCC 


Gin 
CAG 


val 
GTG 


Cys 
TGT 


Ala 
GCC 


Ser 
TCC 


Asn 
AAT 


Ala 
GCA 


He 
ATT 


Gin 
CAG 


Pro 
CCA 


Leu 
CTG 


Val 
GTG 


Val 
GTG 


Ser 
TCT 


Ser 
TCT 


Gin 
CAG 


Gly 
GGC 


Pro 
CCT 


Pro 
CCC 


His 
CAC 


Ser 
AGC 


Arg 
CGC 


Val 


Leu 

CTG 


Gly 
GGC 


Val 
GTG 


Leu 
CTC 


Thr 
ACC 


Ala 
GCC 


Leu 
TTG 


Ala 
GCC 


Leu 
CTC 


He 
ATC 


Leu 
CTG 


Leu 
CTT 


Arg 
CGG 


Arg 
CGT 


Phe 
TTC 


Gly 
GGG 


Gin 
CAA 


Ala 
GCC 


Phe 
TTT 


Asp 
GAC 


Glu 
GAG 


Pro 
CCA 


Ala 
GCT 


Val 
GTA 


His 
CAC 


Phe 
TTC 


Arg 
CGG 
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Thr 
ACA 


Gly 
GGA 


Val 
GTG 


Thr 
ACC 


Gin 
CAG 


Arg 
CGC 


474 


Arg 
CGC 


Asn 
AAC 


He 
ATA 


Lys 
AAA 


Gly 
GGC 


Leu 
CTG 


513 


Val 
GTT 


Arg 
CGC 


Leu 
CTT 


Gin 
CAA 


Ala 
GCA 


Pro 
CCG 


552 


Thr 
ACA 


Val 
GTA 


Thr 
ACA 


Thr 
ACG 


He 
ATC 


Ser 
TCC 


591 


Trp 
TGG 


Val 
GTG 


Pro 
CCA 


Gly 
GGT 


Ala 
GCT 


Asp 
GAC 


630 


Cys 
TGT 


Thr 
ACT 


Val 
GTA 


Gin 
CAG 


Val 
GTG 


Ala 
GCA 


669 


Ala 
GCC 


Leu 
CTT 


Ala 
GCT 


Val 
GTT 


Val 
GTG 


Val 
GTT 


708 


Leu 
CTG 


Leu 
CTT 


Arg 
CGG 


Asn 
AAC 


Leu 
TTG 


Ala 
GCC 


747 


Arg 
AGG 


Val 
GTG 


Arg 
CGC 


Cys 
TGT 


Ala 
GCC 


Asn 
AAT 


786 


Gly 
GGC 


Asp 
GAC 


Trp 
TGG 


Val 
GTG 


Pro 
CCC 


Phe 
TTT 


825 


Ala 
GCC 


Arg 
AGA 


Ala 
GCT 


Pro 
CCT 


Gin 
CAG 


Asn 
AAT 


864 


Ser 
TCA 


Gly 
GGC 


Leu 
CTT 


He 
ATC 


Leu 
CTG 


Glu 
GAA 


903 


Asp 
GAC 


Pro 
CCT 


Gly 
GGG 


Glu 
GAA 


Gly 
GGC 


Pro 
CCC 


942 


Trp 
TGG 


Val 
GTC 


Gin 
CAA 


Glu 
GAA 


Asn 
AAT 


Gly 
GGA 


981 


Glu 
GAA 


Gly 
GGG 


Thr 
ACC 


Arg 
AGG 


Ala 
GCC 


Asn 
AAT 


1 020 


Lys 
AAG 


Asp 
GAC 


Leu 
CTG 


He 
ATT 


Leu 
TTG 


Arg 
CGT 


1059 


Gly 
GGT 


Asp 
GAT 


Gly 
GGG 


Pro 
CCC 


Trp 
TGG 


Ser 
AGT 


1098 


His 
CAT 


Asp 
GAC 


His 
CAT 


Ala 
GCA 


Gly 
GGG 


Arg 
AGG 


1 137 


Thr 
ACA 


Ser 
TCC 


Trp 
TGG 


Val 
GTG 


Pro 
CCT 


Val 
GTG 


1 176 


Leu 
CTG 


He 
ATC 


Thr 

AHA 


Ala 


Ala 

PPT 


Ala 


1215 


Lys 
AAG 


Arg 
AGA 


Arg 
CGC 


Lys 
AAG 


Glu 
GAG 


Thr 
ACG 


1254 


Ser 
AGT 


Val 
GTC 


Met 
ATG 


Ala 
GCC 


Arg 
CGA 


Gly 
GGG 


1293 


Ala 
GCA 


Ala 
GCC 


Arg 
CGA 


Ser 
TCT 


Phe 
TTC 


Asn 
AAT 


1332 
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Arg 
CGA 


Glu 
GAA 


Arg 
AGG 


Pro 
CCT 


Glu 
GAA 


Arg 
CGC 


lie 
ATT 


Glu 
GAG 


Ala 
GCC 


Thr 
ACA 


Leu 
TTG 


Asp 
GAT 


Ser 
AGC 


1371 


Leu 
CTG 


Gly 
GGC 


He 
ATC 


Ser 
AGC 


Asp 
GAT 


Glu 
GAA 


Leu 
T^G 


Lys 
AAG 


Glu 
GAA 


Lys 
AAG 


Leu 
CTG 


Glu 
GAG 


Asp 
GAT 


1410 


Val 
GTC 


Leu 
CTC 


He 
ATT 


Pro 
CCA 


Glu 
GAG 


Gin 
CAG 


Gin 
CAG 


Phe 
TTC 


Thr 
ACC 


Leu 
CTC 


Gly 
GGT 


Arg 
CGG 


Met 
ATG 


1449 


Leu 
TTG 


Gly 
GGC 


Lys 
AAA 


Gly 
GGA 


Glu 
GAG 


Phe 
TTT 


Gly 
GGA 


Ser 
TCA 


Val 
GTG 


Arg 
CGG 


Glu 
GAA 


Ala 
GCC 


Gin 
CAG 


1488 


Leu 
CTA 


Lys 
AAG 


Gin 
CAG 


Glu 
GAA 


Asp 
GAT 


Gly 
GGC 


Ser 
TCC 


Phe 
TTC 


Val 
GTG 


Lys 
AAA 


Val 
GTG 


Ala 
GCA 


Val 
GTG 


1527 


Lys 
AAG 


Met 
ATG 


Leu 
CTG 


Lys 
AAA 


Ala 
GCT 


Asp 
GAC 


He 
ATC 


He 
ATT 


Ala 
GCC 


Ser 
TCA 


Ser 
AGC 


Asp 
GAC 


He 
ATA 


1566 


Glu 
GAA 


Glu 
GAG 


Phe 
TTC 


Leu 
CTC 


Arg 
CGG 


Glu 
GAA 


Ala 
GCA 


Ala 
GCT 


Cys 
TGC 


Met 
ATG 


Lys 
AAG 


Glu 
GAG 


Phe 
TTT 


1605 


Asp 
GAC 


His 
CAT 


Pro 
CCA 


His 
CAC 


Val 
GTG 


Ala 
GCC 


Lys 
AAG 


Leu 
CTT 


Val 
GTT 


Gly 
GGG 


Val 
GTG 


Ser 
AGC 


Leu 
CTC 


1644 


Arg 
CGG 


Ser 
AGC 


Arg 
AGG 


Ala 
GCT 


Lys 
AAA 


Gly 
GGT 


Arg 
CGT 


Leu 
CTC 


Pro 
CCC 


He 
ATT 


Pro 
CCC 


Met 
ATG 


Val 
GTC 


1683 


He 
ATC 


Leu 
CTG 


Pro 
CCC 


Phe 
TTC 


Met 
ATG 


Lys 
AAA 


His 
CAT 


Gly 
GGA 


Asp 
GAC 


Leu 
TTG 


His 
CAC 


Ala 
GCC 


Phe 
TTT 


1722 


Leu 
CTG 


Leu 
CTC 


Ala 
GCC 


Ser 
TCC 


Arg 
CGA 


He 
ATC 


Gly 
GGG 


Glu 
GAG 


Asn 
AAC 


Pro 
CCT 


Phe 
TTT 


Asn 
AAC 


Leu 
CTG 


1 761 


Pro 
CCC 


Leu 
CTG 


Gin 
CAG 


Thr 
ACC 


Leu 
CTG 


Val 
GTC 


Arg 
CGG 


Phe 
TTC 


Met 
ATG 


Val 
GTG 


Asp 
GAC 


He 
ATT 


Ala 
GCC 


1800 


Cys 
TGT 


Gly 
GGC 


Met 
ATG 


Glu 
GAG 


Tyr 
TAC 


Leu 
CTG 


Ser 
AGC 


Ser 
TCC 


Arg 
CGG 


Asn 
AAC 


Phe 
TTC 


He 
ATC 


His 
CAC 


1839 


Arg 
CGA 


Asp 
GAC 


Leu 
CTA 


Ala 
GCA 


Ala 
GCT 


Arg 
CGG 


Asn 
AAT 


Cys 
TGC 


Met 
ATG 


Leu 
CTG 


Ala 
GCC 


Glu 
GAG 


Asp 
GAC 


1878 


Met 
ATG 


Thr 
ACA 


Val 
GTG 


Cys 
TGT 


Val 
GTG 


Ala 
GCT 


Asp 
GAT 


Phe 
TTT 


Gly 
GGA 


Leu 
CTC 


Ser 
TCT 


Arg 
CGG 


Lys 
AAA 


1917 


He 
ATC 


Tyr 
TAT 


Ser 
AGC 


Gly 
GGG 


Asp 
GAC 


Tyr 
TAT 


Tyr 
TAT 


Arg 
CGT 


Gin 
CAG 


Gly 
GGC 


Cys 
TGT 


Ala 
GCC 


Ser 
TCC 


1956 


Lys 
AAA 


Leu 
TTG 


Pro 
CCC 


Val 
GTC 


Lys 
AAG 


Trp 
TGG 


Leu 
CTG 


Ala 
GCC 


Leu 
CTG 


Glu 
GAG 


Ser 
AGC 


Leu 
TTG 


Ala 

GCT 


1995 


Asp 
GAC 


Asn 
AAC 


Leu 
TTG 


Tyr 
TAT 


Thr 
ACT 


Val 
GTA 


His 
CAC 


Ser 
AGT 


Asp 
GAT 


Val 
GTG 


Trp 
TGG 


Ala 
GCC 


Phe 
TTC 


2034 


Gly 
GGG 


Val 
GTG 


Thr 
ACC 


Met 
ATG 


Trp 
TGG 


Glu 
GAG 


He 
ATC 


Met 
ATG 


Thr 
ACT 


Arg 
CGT 


Gly 
GGG 


Gin 
CAG 


Thr 
ACG 


2073 


CCA 


i yx 
TAT 


Ala 
GCT 


GGC 


Tl f> 

ATT 


GAA 


Asn 
AAT 


Ala 

Aid 

GCT 


GAG 


Tie 
11c 

ATT 


Tyr 
TAC 


Asn 
AAC 


Tyr 
TAC 


2112 


Leu 
CTC 


He 
ATC 


Gly 
GGC 


Gly 
GGG 


Asn 
AAC 


Arg 
CGC 


Leu 
CTG 


Lys 
AAG 


Gin 
CAG 


Pro 
CCT 


Pro 
CCG 


Glu 
GAG 


Cys 
TGC 


2151 


Met 
ATG 


Glu 
GAG 


Glu 
GAA 


Val 
GTG 


Tyr 
TAT 


Asp 
GAT 


Leu 
CTC 


Met 
ATG 


Tyr 
TAC 


Gin 
CAG 


Cys 
TGC 


Trp 
TGG 


Ser 
AGC 


2190 


Ala 
GCC 


Asp 
GAC 


Pro 
CCC 


Lys 
AAG 


Gin 
CAG 


Arg 
CGC 


Pro 
CCA 


Ser 
AGC 


Phe 
TTC 


Thr 
ACG 


Cys 
TGT 


Leu 
CTG 


Arg 
CGA 


2229 
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Met 
ATG 


Glu 
GAA 


Leu 
CTG 


Glu 
GAG 


Asn 
AAC 


lie 
ATT 


Leu 
CTG 


Gly 
GGC 


His 
CAC 


Leu 
CTG 


Ser 
TCT 


Val 
GTG 


Leu 
CTG 


Ser 
TCC 


Thr 
*CC 


Ser 
AGC 


Gin 
CAG 


Asp 
GAC 


Pro 
CCC 


Leu 
TTG 


Tyr 
TAC 


He 
ATC 


Asn 
AAC 


He 
ATT 


Glu 
GAG 


Arg 
AGA 


Ala 
GCT 


Clu 
GAG 


Gin 
CAG 


Pro 
CCT 


Thr 
ACT 


Glu 
GAG 


Ser 
AGT 


Gly 
GGC 


Ser 
AGC 


Pro 
CCT 


Glu 
GAG 


Leu 
CTG 


His 
CAC 


Cys 
TGT 


Gly 
GGA 


Glu 
GAG 


Arg 
CGA 


Ser 
TCC 


Ser 
AGC 


Ser 
AGC 


Glu 
GAG 


Ala 
GCA 


Gly 
GGG 


Asp 
GAC 


Gly 
GGC 


Ser 
AGT 


Gly 


Val 
\j ± \j 


Gly 

rrr 


Ala 


Val 


Gly 


Gly 


He 

AIL 


Pro 


Ser 

AbT 


Asp 
GAC 


Ser 
TCT 


Arg 
CGG 


Tyr 
TAC 


lie 
ATC 


Phe 
TTC 


Ser 
AGC 


Pro 
CCC 


Gly 
GGA 


Gly 
GGG 


Leu 
CTA 


Ser 
TCC 


Glu 
GAG 


Ser 
TCA 


Pro 
CCA 


Gly 
GGG 


Gin 
CAG 


Leu 
CTG 


Glu 
GAG 


Gin 
CAG 


Gin 
CAG 


Pro 
CCA 


Glu 
GAA 


Ser 
AGC 


Pro 
CCC 


Leu 
CTC 


Asn 
AAT 


Glu 
GAG 


Asn 
AAC 


Gin 
CAG 


Arg 
AGG 


Leu 
CTG 


Leu 
TTG 


Leu 
TTG 


Leu 
CTG 


Gin 
CAG 


Gin 
■CAA 


Gly 
GGG 


Leu 
CTA 


Leu 
CTG 


Pro 
CCT 


His 
CAC 



Ser Ser Cys 
AGT AGC TGT 



(10) INFORMATION FOR SEQUENCE ID NO. 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4364 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 9: 

CATTAGATCTTTACATGAAAGTAAAATTTATAAGATTTCTAGAAAGTCAAAAGATGATAA 
•CTATTTCTTAGGATACTAAAAGCACTCACATTATAGAAAAAAAATCAGTTAACTATACTC 
CACAAACATTAAAGGCTCCCTATAAAAAAACATTTTTAATAGGCAAGCCACAGAAAGGGC 
AAATATTAATAGTTTGCAATACATATGTATGAAAAGGAATTGAATCTAGAATATTTAACA 
AAGCTTTACAACTCAAAAAATACAAAGAAAATATTTTTCTTCCAATTGGCAAATTACTTA 
AACAGAACCTTCACAAAAGAAGATAAGAATGTTTAATAAACATTTGAAGCCATAATAATG 
ACATCATTAGCCATGATGGAAATGCAAATTTAAGTACCACTTCACATCCACAAGAAAAAG 
ATAAAAATAAAAGGACTGAGCTCACCAAACATTGGTGAGGATGTGGTAATACTGAAATTC 
TTGTACCGTGCTCCTGAGGGTATAACATATTACAGGATTTTTTTGAAAACTAGTGGTTCC 
TTATAAACTTAATGCCCTGGCAACCTCACACCTATTTACTTAAGAATGAAAGGGCCCCGC 

CCTCCTCCCTCCTCGCTCGCGGGCCGGGCCCGGCATGGTGCGGCGTCGCCGCCGATGGCG 
CTGAGGCGGAGC 

Met Gly Arg Pro Gly Leu Pro Pro Leu Pro Leu Pro Pro 
ATG GGG CGG CCG GGG CTC CCG CCG CTG CCG CTG CCG CCG 

Pro Pro Arg Leu Gly Leu Leu Leu Ala Glu Ser Ala Ala 
CCA CCG CGG CTC GGG CTG CTG CTG GCG GAG TCC GCC GCC 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr 
GCA GGT CTG AAG CTC ATG GGA GCC CCG GTG AAG CTG ACA 

Val Ser Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val 
GTG TCT CAG GGG CAG CCG GTG AAG CTC AAC TGC AGT GTG 
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Glu 
GAG 


Gly 
GGG 


Met 
ATG 


Glu 
GAG 


Glu 
GAG 


Pro 
CCT 


Asp 
GAC 


He 
ATC 


Gin 
CAG 


Trp 
TGG 


Val 
GTG 


Lys 
AAG 


Asp 
GAT 


867 


Gly 
GGG 


Ala 
GCT 


val 

GTG 


Val 
GTC 


Gin 
CAG 


Asn 
AAC 


Leu 
TTG 


Asp 
GAC 


Gin 
CAG 


Leu 
TTG 


Tyr 
TAC 


He 
ATC 


Pro 
CCA 


906 


Val 
GTC 


Ser 
AGC 


Glu 
GAG 


Gin 
CAG 


His 
CAC 


Trp 
TGG 


He 
ATC 


Gly 
GGC 


Phe 
TTC 


Leu 
CTC 


Ser 
AGC 


Leu 
CTG 


Lys 
AAG 


945 


Ser 
TCA 


Val 
GTG 


Glu 
GAG 


Arg 
CGC 


Ser 
TCT 


Asp 
GAC 


Ala 
GCC 


Gly 
GGC 


Arg 
CGG 


Tyr 
TAC 


Trp 
TGG 


Cys 
TGC 


Gin 
CAG 


984 


Val 
GTG 


Glu 
GAG 


Asp 
GAT 


Gly 
GGG 


Gly 
GGT 


Glu 
GAA 


Thr 
ACC 


Glu 
GAG 


He 
ATC 


Ser 
TCC 


Gin 
CAG 


Pro 
CCA 


Val 
GTG 


1023 


Trp 
TGG 


Leu 
CTC 


Thr 
ACG 


Val 
GTA 


Glu 
GAA 


Gly 
GGT 


Val 
GTG 


Pro 
CCA 


Phe 
TTT 


Phe 
TTC 


Thr 
ACA 


Val 
GTG 


Glu 
GAG 


1062 


Pro 
CCA 


Lys 
AAA 


Asp 
GAT 


Leu 
CTG 


Ala 
GCA 


Val 
GTG 


Pro 
CCA 


Pro 
CCC 


Asn 
AAT 


Ala 
GCC 


Pro 
CCT 


Phe 
TTC 


Gin 
CAA 


1 101 


Leu 
CTG 


Ser 
TCT 


Cys 
TGT 


Glu 
GAG 


Ala 
GCT 


Val 
GTG 


Gly 
GGT 


Pro 
CCC 


Pro 
CCT 


Glu 
GAA 


Pro 
CCT 


Val 
GTT 


Thr 
ACC 


1 140 


He 
ATT 


Val 
GTC 


Trp 
TGG 


Trp 
TGG 


Arg 
AGA 


Gly 
GGA 


Thr 
ACT 


Thr 
ACG 


Lys 
AAG 


He 
ATC 


Gly 
GGG 


Gly 
GGA 


Pro 
CCC 


1 179 


Ala 
GCT 


Pro 
CCC 


Ser 
TCT 


Pro 
CCA 


Ser 
TCT 


Val 
GTT 


Leu 
TTA 


Asn 
AAT 


Val 
GTA 


Thr 
ACA 


Gly 
GGG 


Val 
GTG 


Thr 
ACC 


1218 


Gin 
CAG 


Ser 
AGC 


Thr 
ACC 


Met 
ATG 


Phe 
TTT 


Ser 
TCC 


Cys 
TGT 


Glu 
GAA 


Ala 
GCT 


His 
CAC 


Asn 
AAC 


Leu 
CTA 


Lys 
AAA 


1257 


Gly 
GGC 


Leu 
CTG 


Ala 
GCC 


Ser 
TCT 


Ser 
TCT 


Arg 
CGC 


Thr 
ACA 


Ala 
GCC 


Thr 
ACT 


Val 
GTT 


His 
CAC 


Leu 
CTT 


Gin 
CAA 


1296 


Ala 
GCA 


Leu 
CTG 


Pro 
CCT 


Ala 
GCA 


Ala 
GCC 


Pro 
CCC 


Phe 
TTC 


Asn 
AAC 


He 
ATC 


Thr 
ACC 


Val 
GTG 


Thr 
ACA 


Lys 
AAG 


1335 


Leu 
CTT 


Ser 
TCC 


Ser 
AGC 


Ser 
AGC 


Asn 
AAC 


Ala 
GCT 


Ser 
AGT 


Val 
GTG 


Ala 
GCC 


Trp 
TGG 


Met 
ATG 


Pro 
CCA 


Glv 
GGT 


1374 


Ala 
GCT 


Asp 
GAT 


Gly 
GGC 


Arg 
CGA 


Ala 
GCT 


Leu 
CTG 


Leu 
CTA 


Gin 
CAG 


Ser 
TCC 


Cys 
TGT 


Thr 
ACA 


Val 
GTT 


Gin 
CAG 


1413 


Val 
GTG 


Thr 
ACA 


Gin 
CAG 


Ala 
GCC 


Pro 
CCA 


Gly 
GGA 


Gly 
GGC 


Trp 
TGG 


Glu 
GAA 


Val 
GTC 


Leu 
CTG 


Ala 
GCT 


Val 
GTT 


1452 


Val 
GTG 


Val 
GTC 


Pro 
CCT 


Val 
GTG 


Pro 
CCC 


Pro 

CCC 


Phe 
TTT 


Thr 
ACC 


Cys 
TGC 


Leu 
CTG 


Leu 
CTC 


Arg 
CGG 


ASD 
GAC 


1491 


Leu 
CTG 


Val 
GTG 


Pro 
CCT 


Ala 
GCC 


Thr 
ACC 


Asn 
AAC 


Tyr 
TAC 


Ser 
AGC 


Leu 
CTC 


Arg 
AGG 


Val 
GTG 


Arc 
CGC 


Cys 
TGT 


1530 


Ala 
GCC 


Asn 
AAT 


Ala 
GCC 


Leu 
TTG 


Gly 
GGG 


Pro 
CCC 


Ser 
TCT 


Pro 
CCC 


Tyr 
TAT 


Ala 
GCT 


Asp 
GAC 


Trp 
TGG 


Val 
GTG 


1569 


Pro 
CCC 


Phe 
TTT 


Gin 
CAG 


Thr 
ACC 


Lvs 
AAG 


Glv 
GGT 


Lsu 
CTA 


Ala 
GCC 


riU 

CCA 


GCC 


oer 
AGC 


Aia 
GCT 


fro 
CCC 


1608 


Gin 
CAA 


Asn 
AAC 


Leu 
CTC 


His 
CAT 


Ala 
GCC 


He 
ATC 


Arg 
CGC 


Thr 
ACA 


Asp 
GAT 


Ser 
TCA 


Gly 
GGC 


Leu 
CTC 


He 
ATC 


1647 


Leu 
TTG 


Glu 
GAG 


Trp 
TGG 


Glu 
GAA 


Glu 
GAA 


Val 
GTG 


He 
ATC 


Pro 
CCC 


Glu 
GAG 


Ala 
GCC 


Pro 
CCT 


Leu 
TTG 


Glu 
GAA 


1686 


Gly 
GGC 


Pro 
CCC 


Leu 
CTG 


Gly 
GGA 


Pro 
CCC 


Tyr 
TAC 


Lys 
AAA 


Leu 
CTG 


Ser 
TCC 


Trp 
TGG 


Val 
GTT 


Gin 
CAA 


Asp 
GAC 


1725 
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Asn Gly Thr Gin Asp 
AAT GGA ACC CAG GAT 

Ala Asn Leu Thr Gly 
GCC AAT TTG ACA GGC 

Val Arg Val Cys Val 
GTA CGT GTG TGC GTC 

Trp Ser Gin Pro Leu 
TGG AGT CAG CCA CTG 

Gly Gin Gin Gly Pro 
GGC CAG CAG GGC CCT 

Pro Val Val Leu Gly 
CCT GTG GTC CTT GGT 

Ala Ala Leu Ala Leu 
GCT GCC CTG GCC CTC 

Glu Thr Arg Phe Gly 
GAG ACG CGG TTT GGG 

Arg Gly Glu Pro Ala 
CGG GGA GAG CCA GCC 

Phe Asn Arg Glu Arg 
TTC AAT CGA GAA AGG 

Asp Ser Leu Gly He 
GAC AGC TTG GGC ATC 

Glu Asp Val Leu He 
GAG GAT GTG CTC ATC 

Arg Met Leu Gly Lys 
CGG ATG TTG GGC AAA 

Ala Gin Leu Lys Gin 
GCC CAG CTG AAG CAA 

Ala Val Lys Met Leu 
GCT GTG AAG ATG CTG 

Asp lie Glu Glu Phe 
GAC ATT GAA GAG TTC 

Glu Phe Asp His Pro 
GAG TTT GAC CAT CCA 

Ser Leu Arg Ser Arg 
AGC CTC CGG AGC AGG 

Met Val He Leu Pro 
ATG GTC ATC TTG CCC 

Ala Phe Leu Leu Ala 
GCC TTC CTG CTC GCC 

Asn Leu Pro Leu Gin 
AAC CTA CCC CTC CAG 

He Ala Cys Gly Met 
ATT GCC TGC GGC ATG 

He His Arg Asp Leu 
ATC CAC CGA GAC CTG 



Glu Leu Thr Val Glu 
GAG CTG ACA GTG GAG 

Trp Asp Pro Gin Lys 
TGG GAT CCC CAA AAG 

Ser Asn Ala Val Gly 
TCC AAT GCA GTT GGC 

Val Val Ser Ser His 
GTG GTC TCT TCT CAT 

Pro His Ser Arg Thr 
CCT CAC AGC CGC ACA 

Val Leu Thr Ala Leu 
GTG CTA ACG GCC CTG 

He Leu Leu Arg Lys 
ATC CTG CTT CGA AAG 

Gin Ala Phe Asp Ser 
CAA GCC TTT GAC AGT 

Val His Phe Arg Ala 
GTT CAC TTC CGG GCA 

Pro Glu Arg He Glu 
CCC GAG CGC ATC GAG 

Ser Asp Glu Leu Lys 
AGC GAT GAA CTA AAG 

Pro Glu Gin Gin Phe 
CCA GAG CAG CAG TTC 

Gly Glu Phe Gly Ser 
GGA GAG TTT GGT TCA 

Glu Asp Gly Ser Phe 
GAG GAT GGC TCC TTT 

Lys Ala Asp He He 
AAA GCT GAC ATC ATT 

Leu Arg Glu Ala Ala 
CTC AGG GAA GCA GCT 

His Val Ala Lys Leu 
CAC GTG GCC AAA CTT 

Ala Lys Gly Arg Leu 
GCT AAA GGC CGT CTC 

Phe Met Lys His Gly 
TTC ATG AAG CAT GGG 

Ser Arg He Gly Glu 
TCC CGG ATT GGG GAG 

Thr Leu He Arg Phe 
ACC CTG ATC CGG TTC 

Glu Tyr Leu Ser Ser 
GAG TAC CTG AGC TCT 

Ala Ala Arg Asn Cys 
GCT GCT CGG AAT TGC 



Gly Thr Arg 

GGG ACC AGG 1764 

Asp Leu He 

GAC CTG ATC 1803 

Cys Gly Pro 

TGT GGA CCC 1842 

Asp Arg Ala 

GAC CGT GCA 1881 

Ser Trp Val 

TCC TGG GTA 1920 

Val Thr Ala 

GTG ACG GCT 1959 

Arg Arg Lys 

AGA CGG AAA 1998 

Val Met Ala 

GTC ATG GCC 2037 

Ala Arg Ser 

GCC CGG TCC 2076 

Ala Thr Leu 

GCC ACA TTG 2115 

Glu Lys Leu 

GAA AAA CTG 2154 

Thr Leu Gly 

ACC CTG GGC 2193 

Val Arg Glu 

GTG CGG GAG 2232 

Val Lys Val 

GTG AAA GTG 2271 

Ala Ser Ser 

GCC TCA AGC 2310 

Cys Met Lys 

TGC ATG AAG 2349 

Val Gly Val 

GTT GGG GTA 2388 

Pro He Pro 

CCC ATC CCC 2427 

Asp Leu His 

GAC CTG CAT 2466 

Asn Pro Phe 

AAC CCC TTT 2505 

Met Val Asp 

ATG GTG GAC 2544 

Arg Asn Phe 

CGG AAC TTC 2583 

Met Leu Ala 

ATG CTG GCA 2622 
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Glu 
GAG 


Asp 
GAC 


Met 
ATG 


Thr 
ACA 


Val 
GTG 


Cys 
TGT 


Val 
GTG 


Ala 
GCT 


Asp 
GAC 


Phe 
TTC 


Gly 
GGA 


Leu 
CTC 


Ser 
TCC 


2661 


Arg 
CGG 


Lys 
AAG 


He 
ATC 


Tyr 
TAC 


Ser 
AGT 


Gly 
GGG 


Asp 
GAC 


Tyr 
TAC 


Tyr 
TAT 


Arg 
CGT 


Gin 
CAA 


Gly 
GGC 


Cys 
TGT 


2700 


Ala 
GCC 


Ser 
TCC 


Lys 
AAA 


Leu 
CTG 


Pro 
CCT 


Val 
GTC 


Lys 
AAG 


Trp 
TGG 


Leu 
CTG 


Ala 
GCC 


Leu 
CTG 


Glu 
GAG 


Ser 
AGC 


2739 


Leu 
CTG 


Ala 
GCC 


Asp 
GAC 


Asn 
AAC 


Leu 
CTG 


Tyr 
TAT 


Thr 
ACT 


Val 
GTG 


Gin 
CAG 


Ser 
AGT 


Asp 
GAC 


Val 
GTG 


Trp 
TGG 


2778 


Ala 
GCG 


Phe 
TTC 


Gly 
GGG 


val 
GTG 


Thr 
ACC 


Met 
ATG 


Trp 
TGG 


Glu 
GAG 


He 
ATC 


Met 
ATG 


Thr 
ACA 


Arg 
CGT 


Gly 
GGG 


2817 


Gin 
CAG 


Thr 
ACG 


Pro 
CCA 


Tyr 
TAT 


Ala 
GCT 


Gly 
GGC 


He 
ATC 


Glu 
GAA 


Asn 
AAC 


Ala 
GCT 


Glu 
GAG 


He 
ATT 


Tyr 
TAC 


2856 


Asn 
AAC 


Tyr 
TAC 


Leu 
CTC 


He 
ATT 


Gly 
GGC 


Gly 
GGG 


Asn 
AAC 


Arg 
CGC 


Leu 
CTG 


Lys 
AAA 


Gin 
CAG 


Pro 
CCT 


Pro 
CCG 


2895 


Glu 
GAG 


Cys 
TGT 


Met 
ATG 


Glu 
GAG 


Asp 
GAC 


Val 
GTG 


Tyr 
TAT 


Asp 
GAT 


Leu 
CTC 


Met 
ATG 


Tyr 
TAC 


Gin- 
CAG 


Cys 
TGC 


2934 


Trp 
TGG 


Ser 
AGT 


Ala 
GCT 


Asp 
GAC 


Pro 
CCC 


Lys 
AAG 


Gin 
CAG 


Arg 
CGC 


Pro 
CCG 


Ser 
AGC 


Phe 
TTT 


Thr 
ACT 


Cys 
TGT 


2973 


Leu 
CTG 


Arg 
CGA 


Met 
ATG 


Glu 
GAA 


Leu 
CTG 


Glu 
GAG 


Asn 
AAC 


He 
ATC 


Leu 
TTG 


Gly 
GGC 


Gin 
CAG 


Leu 
CTG 


Ser 
TCT 


3012 


Val 
GTG 


Leu 
CTA 


Ser 
TCT 


Ala 
GCC 


Ser 
AGC 


Gin 
CAG 


Asp 
GAC 


Pro 
CCC 


Leu 
TTA 


Tyr 
TAC 


He 
ATC 


Asn 
AAC 


He 
ATC 


3051 


Glu 
GAG 


Arg 
AGA 


Ala 
GCT 


Glu 
GAG 


Glu 
GAG 


Pro 
CCC 


Thr 
ACT 


Val 
GTG 


Gly 
GGA 


Gly 
GGC 


Ser 
AGC 


Leu 
CTG 


Glu 
GAG 


3090 


Leu 
CTA 


Pro 
CCT 


Gly 
GGC 


Arg 
AGG 


Asp 
GAT 


Gin 
CAG 


Pro 
CCC 


Tyr 
TAC 


Ser 
AGT 


Gly 
GGG 


Ala 
GCT 


Gly 
GGG 


Asp 
GAT 


3129 


Gly 
GGC 


Ser 
AGT 


Gly 
GGC 


Met 
ATG 


Gly 
GGG 


Ala 
GCA 


Val 
GTG 


Gly 
GGT 


Gly 
GGC 


Thr 
ACT 


Pro 
CCC 


Ser 
AGT 


Asp 
GAC 


3168 


Cys 
TGT 


Arg 
CGG 


Tyr 
TAC 


He 
ATA 


Leu 
CTC 


Thr 
ACC 


Pro 
CCC 


Gly 
GGA 


Gly 
GGG 


Leu 
CTG 


Ala 
GCT 


Glu 
GAG 


Gin 
CAG 


3207 


Pro 
CCA 


Gly 
GGG 


Gin 
CAG 


Ala 
GCA 


Glu 
GAG 


His 
CAC 


Gin 
CAG 


Pro 
CCA 


Glu 
GAG 


Ser 
AGT 


Pro 
CCC 


Leu 
CTC 


Asn 
AAT 


3246 


Glu 
GAG 


Thr 
ACA 


Gin 
CAG 


Arg 
AGG 


Leu 
CTT 


Leu 
TTG 


Leu 
CTG 


Leu 
CTG 


Gin 
CAG 


Gin 
CAA 


Gly 
GGG 


Leu 
CTA 


Leu 
CTG 


3285 


Pro 
CCA 


His 
CAC 


Ser 
AGT 


Ser 
AGC 


Cys 
TGT 


















3300 



TAGCCCACAGGCAGAGGGCATCGGGGCCATTTGGCCGGCTCTGGTGGCCACTGAGCTGGC 3360 

TGACTAAGCCCCGTCTGACCCCAGCCCAGACAGCAAGGTGTGGAGGCTCCTGTGGTAGTC 3420 

CTCCCAAGCTGTGCTGGGAAGCCCGGACTGACCAAATCACCCAATCCCAGTTCTTCCTGC 3480 

AACCACTCTGTGGCCAGCCTGGC ATCAGTTTAGGCCTTGGCTTG ATGGAAGTGGGCCAGT 3540 

CCTGGTTGTCTGAACCCAGGCAGCTGGCAGGAGTGGGGTGGTTATGTTTCCATGGTTACC 3600 

ATGGGTGTGGATGGCAGTGTGGGGAGGGCAGGTCCAGCTCTGTGGGCCCTACCCTCCTGC 3660 

TG AGCTGCCCCTGCTGCTT AAGTGCATGCATTGAGCTGCCTCCAGCCTGGTGGCCCAGCT 3720 

ATTACCACACTTGGGGTTTAAATATCCAGGTGTGCCCCTCCAAGTCAGAAAGAGATGTCC 3780 

TTGTAATATTCCCTTTTAGGTGAGGGTTGGTAAGGGGTTGGTATCTCAGGTCTGAATCTT 3840 

CACCATCTTTCTGATTCCGCACCCTGCCTACGCCAGGAGAAGTTGAGGGGAGCATGCTTC 3900 

CCTGCAGCTGACCGGGTCACACAAAGGCATGCTGGAGTACCCAGCCTATCAGGTGCCCCT 3960 

CTTCCAAAGGCAGCGTGCCGAGCCAGCAAGAGGAAGGGGTGCTGTGAGGCTTGCCCAGGA 4020 
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GCAAGTGAGGCCGGAGAGGAGTTCAGG AACCCTTCTCCATACCCACAATCTGAGCACGCT 4060 

ACCAAATCTCAAAATATCCTAAGACTAACAAAGGCAGCTGTGTCTGAGCCCAACCCTTCT 4140 

AAACGGTGACCTTTAGTGCCAACTTCCCCTCTAACTGGACAGCCTCTTCTGTCCCAAGTC 4200 

TCCAGAGAGAAATCAGGCCTGATGAGGGGGAATTCCTGGAACCTGGACCCCAGCCTTGGT 4260 

GGGGGAGCCTCTGG AATGCATGGGGCGGGTCCTAGCTGTTAGGGACATTTCCAAGCTGTT 4320 
AGTTGCTGTTTAAAATAGAAATAAAATTGAAGACTAAAGACCTA 



(11) INFORMATION FOR SEQUENCE ID NO. 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 10: 



4364 



Ala 
GCA 


Gly 
GGT 


Leu 
CTG 


Lys 
AAG 


Leu 
CTC 


Met 
ATG 


Gly 
GGA 


Ala 
GCC 


Pro 
CCG 


Val 
GTG 


Lys 
AAG 


Leu 
CTG 


Thr 
ACA 


39 


Val 
GTG 


Ser 
TCT 


Gin 
CAG 


Gly 
GGG 


Gin 
CAG 


Pro 
CCG 


Val 
GTG 


Lys 
AAG 


Leu 
CTC 


Asn 
AAC 


Cys 
TGC 


Ser 
AGT 


Val 
GTG 


78 


Glu 
GAG 


Gly 
GGG 


Met 
ATG 


Glu 
GAG 


Glu 
GAG 


Pro 
CCT 


Asp 
GAC 


He 
ATC 


Gin 
CAG 


TGG 


Val 
GTG 


T vc 

AAG 


Asp 
GAT 


117 


Gly 
GGG 


Ala 
GCT 


Val 
GTG 


Val 
GTC 


Gin 
CAG 


Asn 
AAC 


Leu 
TTG 


Asp 
GAC 


Gin 
CAG 


Leu 
TTG 


Tyr 
TAC 


He 
ATC 


Pro 
CCA 


156 


Val 
GTC 


Ser 
AGC 


Glu 
GAG 


Gin 
CAG 


His 
CAC 


Trp 
TGG 


He 
ATC 


Gly 
GGC 


Phe 
TTC 


Leu 
CTC 


Ser 
AGC 


Leu 
CTG 


Lys 
AAG 


195 


Ser 
TCA 


Val 
GTG 


Glu 
GAG 


Arg 
CGC 


Ser 
TCT 


Asp 
GAC 


Ala 
GCC 


Gly 
GGC 


Arg 
CGG 


Tyr 
TAC 


Trp 
TGG 


Cys 
TGC 


Gin 
CAG 


234 


Val 
GTG 


Glu 
GAG 


Asp 
GAT 


Gly 
GGG 


Gly 
GGT 


Glu 
GAA 


Thr 
ACC 


Glu 
GAG 


He 
ATC 


Ser 
TCC 


Gin 
CAG 


Pro 
CCA 


Val 
GTG 


273 


Trp 
TGG 


Leu 
CTC 


Thr 
ACG 


Val 
GTA 


Glu 
GAA 


Gly 
GGT 


Val 
GTG 


Pro 
CCA 


Phe 
TTT 


Phe 
TTC 


Thr 
ACA 


Val 
GTG 


Glu 
GAG 


312 


Pro 
CCA 


Lys 
AAA 


Asp 
GAT 


Leu 
CTG 


Ala 
GCA 


Val 
GTG 


Pro 
CCA 


Pro 
CCC 


Asn 
AAT 


Ala 
GCC 


Pro 
CCT 


Phe 
TTC 


Gin 
CAA 


351 


Leu 
CTG 


Ser 
TCT 


Cys 
TGT 


Glu 
GAG 


Ala 
GCT 


Val 
GTG 


Gly 
GGT 


Pro 
CCC 


Pro 
CCT 


Glu 
GAA 


Pro 
CCT 


Val 
GTT 


Thr 
ACC 


390 


lie 
ATT 


Val 
GTC 


Trp 
TGG 


Trp 
TGG 


Arg 
AGA 


Gly 
GGA 


Thr 
ACT 


Thr 
ACG 


Lys 
AAG 


He 
ATC 


Gly 
GGG 


Gly 
GGA 


Pro 
CCC 


429 


Ala 
GCT 


Pro 
CCC 


Ser 
TCT 


Pro 
CCA 


Ser 
TCT 


Val 
GTT 


Leu 
TTA 


Asn 
AAT 


Val 
GTA 


Thr 
ACA 


Gly 
GGG 


Val 
GTG 


Thr 
ACC 


468 


Gin 
CAG 


Ser 
AGC 


Thr 
ACC 


Met 
ATG 


Phe 
TTT 


Ser 
TCC 


Cys 
TGT 


Glu 
GAA 


Ala 
GCT 


His 
CAC 


Asn 
AAC 


Leu 
CTA 


Lys 
AAA 


507 


Gly 
GGC 


Leu 
CTG 


Ala 
GCC 


Ser 
TCT 


Ser 
TCT 


Arg 
CGC 


Thr 
ACA 


Ala 
GCC 


Thr 
ACT 


Val 
GTT 


His 
CAC 


Leu 
CTT 


Gin 
CAA 


546 
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Ala Leu Pro Ala Ala Pro Phe Asn He Thr Val Thr Lys 

GCA CTG CCT GCA GCC CCC TTC AAC ATC ACC GTG ACA AAG 

Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 

CTT TCC AGC AGC AAC GCT AGT r.TG GCC TGG ATG CCA GGT 

Ala Asp Gly Arg Ala Leu Leu Gin Ser Cys Thr Val Gin 

GCT GAT GGC CGA GCT CTG CTA CAG TCC TGT ACA GTT CAG 

Val Thr Gin Ala Pro Gly Gly Trp Glu Val Leu Ala Val 

GTG ACA CAG GCC CCA GGA GGC TGG GAA GTC CTG GCT GTT 

Val Val Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asp 

GTG GTC CCT GTG CCC CCC TTT ACC TGC CTG CTC CGG GAC 

Leu Val Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys 

CTG GTG CCT GCC ACC AAC TAC AGC CTC AGG GTG CGC TGT 

Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp Val 

GCC AAT GCC TTG GGG CCC TCT CCC TAT GCT GAC TGG GTG 



Pro Phe Gin Thr Lys Gly 

CCC TTT CAG ACC AAG GGT 

Gin Asn Leu His Ala He 

CAA AAC CTC CAT GCC ATC 

Leu Glu Trp Glu Glu Val 

TTG GAG TGG GAA GAA GTG 

Gly Pro Leu Gly Pro Tyr 

GGC CCC CTG GGA CCC TAC 

Asn Gly Thr Gin Asp Glu 

AAT GGA ACC CAG GAT GAG 

Ala Asn Leu Thr Gly Trp 

GCC AAT TTG ACA .GGC TGG 

Val Arg Val Cys Val Ser 

GTA CGT GTG TGC GTC TCC 

Trp Ser Gin Pro Leu Val 

TGG AGT CAG CCA CTG GTG 

Gly Gin Gin Gly Pro Pro 

GGC CAG CAG GGC CCT CCT 

Pro Val Val Leu Gly Val 

CCT GTG GTC CTT GGT GTG 

Ala Ala Leu Ala Leu He 

GCT GCC CTG GCC CTC ATC 

Glu Thr Arg Phe Gly Gin 

GAG ACG CGG TTT GGG CAA 

Arg Gly Glu Pro Ala Val 

CGG GGA GAG CCA GCC GTT 

Phe Asn Arg Glu Arg Pro 

TTC AAT CGA GAA AGG CCC 

Asp Ser Leu Gly He Ser 

GAC AGC TTG GGC ATC AGC 



Leu Ala Pro Ala Ser Ala Pro 

CTA GCC CCA GCC AGC GCT CCC 

Arg Thr Asp Ser Gly Leu He 

CGC ACA GAT TCA GGC CTC ATC 

He Pro Glu Ala Pro Leu Glu 

ATC CCC GAG GCC CCT TTG GAA 

Lys Leu Ser Trp Val Gin Asp 

AAA CTG TCC TGG GTT CAA GAC 

Leu Thr Val Glu Gly Thr Arg 

CTG ACA GTG GAG GGG ACC AGG 

Asp Pro Gin Lys Asp Leu He 

GAT CCC CAA AAG GAC CTG ATC 

Asn Ala Val Gly Cys Gly Pro 

AAT GCA GTT GGC TGT GGA CCC 

Val Ser Ser His Asp Arg Ala 

GTC TCT TCT CAT GAC CGT GCA 

His Ser Arg Thr Ser Trp Val 

CAC AGC CGC ACA TCC TGG GTA 

Leu Thr Ala Leu Val Thr Ala 

CTA ACG GCC CTG GTG ACG GCT 

Leu Leu Arg Lys Arg Arg Lys 

CTG CTT CGA AAG AGA CGG AAA 

Ala Phe Asp Ser Val Met Ala 

GCC TTT GAC AGT GTC ATG GCC 

His Phe Arg Ala Ala Arg Ser 

CAC TTC CGG GCA GCC CGG TCC 

Glu Arg He Glu Ala Thr Leu 

GAG CGC ATC GAG GCC ACA TTG 

Asp Glu Leu Lys Glu Lys Leu 

GAT GAA CTA AAG GAA AAA CTG 



585 

624 

663 

702 

741 

780 

819 

858 

897 

936 

975 

1014 

1053 

1092 

1131 

1170 

1209 

1248 

1287 

1326 

1365 

1404 



Glu Asp Val Leu He Pro Glu Gin Gin Phe Thr Leu Gly 
GAG GAT GTG CTC ATC CCA GAG CAG CAG TTC ACC CTG GGC 



1443 
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Arg 
CGG 


Met 
ATG 


Leu 
TTG 


Gly 
GGC 


Lys 
AAA 


Gly 
GGA 


Glu 
GAG 


Phe 
TTT 


Gly 
GGT 


Ser 
TCA 


Val 
GTG 


Arg 
CGG 


Glu 
GAG 


1482 


Ala 
GCC 


Gin 
CAG 


Leu 
CTG 


Lys 
AAG 


Gin 
CAA 


Glu 
GAG 


Asp 
GAT 


Gly 
GGC 


Ser 
TCC 


Phe 
TTT 


Val 
GTG 


Lys 
AAA 


Val 
GTG 


1521 


Ala 
GCT 


Val 
GTG 


Lys 
AAG 


Met 
ATG 


Leu 
CTG 


Lys 
AAA 


Ala 
GCT 


Asp 
GAC 


He 
ATC 


He 
ATT 


Ala 
GCC 


Ser 
TCA 


Ser 
AGC 


1 560 


Asp 
GAC 


lie 
ATT 


Glu 
GAA 


Glu 
GAG 


Phe 
TTC 


Leu 
CTC 


Arg 
AGG 


Glu 
GAA 


Ala 
GCA 


Ala 
GCT 


Cys 
TGC 


Met 
ATG 


Lys 
AAG 


1599 


Glu 
GAG 


Phe 
TTT 


Asp 
GAC 


His 
CAT 


Pro 
CCA 


His 
CAC 


Val 
GTG 


Ala 
GCC 


Lys 

AAA 


Leu 
CTT 


Val 
GTT 


Gly 
GGG 


Val 
GTA 


1638 


Ser 
AGC 


Leu 
CTC 


Arg 
CGG 


Ser 
AGC 


Arg 
AGG 


Ala 
GCT 


Lys 
AAA 


Gly 
GGC 


Arg 
CGT 


Leu 
CTC 


Pro 
CCC 


He 
ATC 


Pro 
CCC 


1677 


Met 
ATG 


Val 
GTC 


He 
ATC 


Leu 
TTG 


Pro 
CCC 


Phe 
TTC 


Met 
ATG 


Lys 
AAG 


His 
CAT 


Gly 
GGG 


Asp 
GAC 


Leu 
CTG 


His 
CAT 


1716 


Ala 
GCC 


Phe 
TTC 


Leu 
CTG 


Leu 
CTC 


Ala 
GCC 


Ser 
TCC 


Arg 
CGG 


He 
ATT 


Gly 
GGG 


Glu 
GAG 


Asn 
AAC 


Pro 
CCC 


Phe 
TTT 


1 755 


Asn 
AAC 


Leu 
CTA 


Pro 
CCC 


Leu 
CTC 


Gin 
CAG 


Thr 
ACC 


Leu 
CTG 


He 
ATC 


Arg 
CGG 


Phe 
TTC 


Met 
ATG 


Val 
GTG 


Asp 
GAC 


1 794 


He 
ATT 


Ala 
GCC 


Cys 
TGC 


Gly 
GGC 


Met 
ATG 


Glu 
GAG 


Tyr 
TAC 


Leu 
CTG 


Ser 
AGC 


Ser 
TCT 


Arg 
CGG 


Asn 
AAC 


Phe 
TTC 


1 833 


He 
ATC 


His 
CAC 


Arg 
CGA 


Asp 
GAC 


Leu 
CTG 


Ala 
GCT 


Ala 
GCT 


Arg 
CGG 


Asn 
AAT 


Cys 
TGC 


Met 
ATG 


Leu 
CTG 


Ala 
GCA 




Glu 
GAG 


Asp 
GAC 


Met 
ATG 


Thr 
ACA 


Val 
GTG 


Cys 
TGT 


Val 
GTG 


Ala 
GCT 


Asp 
GAC 


Phe 
TTC 


Gly 
GGA 


Leu 
CTC 


Ser 
TCC 


1911 


Arg 
CGG 


Lys 
AAG 


He 
ATC 


Tyr 
TAC 


Ser 
AGT 


Gly 
GGG 


Asp 
GAC 


Tyr 
TAC 


Tyr 
TAT 


Arg 
CGT 


Gin 
CAA 


Gly 
GGC 


Cys 
TGT 


1950 


Ala 
GCC 


Ser 
TCC 


Lys 
AAA 


Leu 
CTG 


Pro 
CCT 


Val 
GTC 


Lys 
AAG 


Trp 
TGG 


Leu 
CTG 


Ala 
GCC 


Leu 
CTG 


Glu 
GAG 


Ser 
AGC 


1 989 


Leu 
CTG 


Ala 
GCC 


Asp 
GAC 


Asn 
AAC 


Leu 
CTG 


Tyr 
TAT 


Thr 
ACT 


Val 
GTG 


Gin 
CAG 


Ser 
AGT 


Asp 
GAC 


Val 
GTG 


Trp 
TGG 


2028 


Ala 
GCG 


Phe 
TTC 


Gly 
GGG 


Val 
GTG 


Thr 
ACC 


Met 
ATG 


Trp 
TGG 


Glu 
GAG 


He 
ATC 


Met 
ATG 


Thr 
ACA 


Arg 
CGT 


Gly 
GGG 


2067 


Gin 
CAG 


Thr 
ACG 


Pro 
CCA 


Tyr 
TAT 


Ala 
GCT 


Gly 
GGC 


He 
ATC 


Glu 
GAA 


Asn 
AAC 


Ala 
GCT 


Glu 
GAG 


He 
ATT 


Tyr 
TAC 


21 06 


Asn 
AAC 


Tyr 
TAC 


Leu 
CTC 


He 
ATT 


Gly 
GGC 


Gly 
GGG 


Asn 
AAC 


Arg 
CGC 


Leu 
CTG 


Lys 
AAA 


Gin 
CAG 


Pro 
CCT 


Pro 
CCG 


2145 


Glu 
GAG 


Cys 
TGT 


Met 
ATG 


Glu 
GAG 


Asp 
GAC 


Val 
GTG 


Tyr 
TAT 


Asp 
GAT 


Leu 
CTC 


Met 
ATG 


Tyr 
TAC 


Gin 
CAG 


Cys 
TGC 


2184 


Trp 
TGG 


Ser 
AGT 


Ala 
GCT 


Asp 
GAC 


Pro 
CCC 


Lys 
AAG 


Gin 
CAG 


Arg 
CGC 


Pro 
CCG 


Ser 


Phe 

A A J. 


Thr 

APT 


Cys 




Leu 
CTG 


Arg 
CGA 


Met 
ATG 


Glu 
GAA 


Leu 
CTG 


Glu 
GAG 


Asn 
AAC 


He 
ATC 


Leu 
TTG 


Gly 
GGC 


Gin 
CAG 


Leu 
CTG 


Ser 
TCT 


2262 


Val 
GTG 


Leu 
CTA 


Ser 
TCT 


Ala 
GCC 


Ser 
AGC 


Gin 
CAG 


Asp 
GAC 


Pro 
CCC 


Leu 
TTA 


Tyr 
TAC 


He 
ATC 


Asn 
AAC 


He 
ATC 


2301 


Glu 
GAG 


Arg 
AGA 


Ala 
GCT 


Glu 
GAG 


Glu 
GAG 


Pro 
CCC 


Thr 
ACT 


Val 
GTG 


Gly 
GGA 


Gly 
GGC 


Ser 
AGC 


Leu 
CTG 


Glu 
GAG 


2340 
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Leu Pro Gly Arg Asp Gin Pro Tyr Ser Gly Ala Gly Asp 

CTA CCT GGC AGG GAT CAG CCC TAC AGT GGG GCT GGG GAT 2379 

Gly Ser Gly Met Gly Ala Val Gly Gly Thr Pro Ser Asp 

GGC AGT GGC ATG GGG GCA GTG GGT GGC ACT CCC AGT GAC 2418 

Cys Arg Tyr lie Leu Thr Pro Gly Gly Leu Ala Glu Gin 

TGT CGG TAC ATA CTC ACC CCC GGA GGG CTG GCT GAG CAG 2457 

Pro Gly Gin Ala Glu His Gin Pro Glu Ser Pro Leu Asn 

CCA GGG CAG GCA GAG CAC CAG CCA GAG AGT CCC CTC AAT 2496 

Glu Thr Gin Arg Leu Leu Leu Leu Gin Gin Gly Leu Leu 

GAG ACA CAG AGG CTT TTG CTG CTG CAG CAA GGG CTA CTG 2535 

Pro His Ser Ser Cys 

CCA CAC AGT AGC TGT 2550 



(12) INFORMATION FOR SEQUENCE ID NO. 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 11: 

Ala Gly 

GCA GGC 6 



Leu 
CTG 


Lys 
AAG 


Leu 
CTC 


Met 
ATG 


Gly 
GGC 


Ala 
GCC 


Pro 
CCA 


Val 
GTG 


Lys 
AAG 


Met 
ATG 


Thr 
ACC 


Val 
GTG 


Ser 
TCT 


45 


Gin 
CAG 


Gly 
GGG 


Gin 
CAG 


Pro 
CCA 


Val 
GTG 


Lys 
AAG 


Leu 
CTC 


Asn 
AAC 


Cys 
TGC 


Ser 
AGC 


Val 
GTG 


Glu 
GAG 


Gly 
GGG 


84 


Met 
ATG 


Glu 
GAG 


Asp 
GAC 


Pro 
CCT 


Asp 
GAC 


He 
ATC 


His 
CAC 


•Trp 
TGG 


Met 
ATG 


Lys 
AAG 


Asp 
GAT 


Gly 
GGC 


Thr 
ACC 


123 


Val 
GTG 


Val 
GTC 


Gin 
CAG 


Asn 
AAT 


Ala 
GCA 


Ser 
AGC 


Gin 
CAG 


Val 
GTG 


Ser 
TCC 


He 
ATC 


Ser 
TCC 


He 
ATC 


Ser 
AGC 


162 


Glu 
GAG 


His 
CAC 


Ser 
AGC 


Trp 
TGG 


He 
ATT 


Gly 
GGC 


Leu 
TTA 


Leu 
CTC 


Ser 
AGC 


Leu 
CTA 


Lys 
AAG 


Ser 
TCA 


Val 
GTG 


201 


Glu 
GAG 


Arg 
CGG 


Ser 
TCT 


Asp 
GAT 


Ala 
GCT 


Gly 
GGC 


Leu 
CTG 


Tyr 
TAC 


Trp 
TGG 


Cys 
TGC 


Gin 
CAG 


Val 
GTG 


Lys 
AAG 


240 


Asp 
GAT 


Gly 
GGG 


Glu 
GAG 


Glu 
GAA 


Thr 
ACC 


Lys 
AAG 


lie 
ATC 


Ser 
TCT 


Gin 
CAG 


Ser 
TCA 


Val 
GTA 


Trp 
TGG 


Leu 
CTC 


279 


Thr 
ACT 


Val 
GTC 


Glu 
GAA 


Gly 
GGT 


Val 
GTG 


Pro 
CCA 


Phe 
TTC 


Phe 
TTC 


Thr 
ACA 


Val 
GTG 


Glu 
GAA 


Pro 
CCA 


Lys 
AAA 


318 


Asp 
GAT 


Leu 
CTG 


Ala 
GCG 


Val 
GTG 


Pro 
CCA 


Pro 
CCC 


Asn 
AAT 


Ala 
GCC 


Pro 
CCT 


Phe 
TTT 


Gin 
CAG 


Leu 
CTG 


Ser 
TCT 


357 


Cys 
TGT 


Glu 
GAG 


Ala 
GCT 


Val 
GTG 


Gly 
GGT 


Pro 
CCT 


Pro 
CCA 


Glu 
GAA 


Pro 
CCC 


Val 
GTA 


Thr 
ACC 


He 
ATT 


Tyr 
TAC 


396 
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Trp 
TGG 


Trp 
TGG 


Arg 
AGA 


Gly 
GGA 


Leu 
CTC 


Thr 
ACT 


Lys 
AAA 


Val 
GTT 


Gly 
GGG 


Gly 
GGA 


Pro 
CCT 


Ala 
GCT 


Pro 
CCC 


4^5 

T *J U 


Ser 
TCT 


Pro 
CCC 


Ser 
TCT 


Val 

GTT 


Leu 

TTA 


Asn 
AAT 


Val 
GTG 


Thr 
kC? 


Gly 
GGA 


Val 
GTG 


Thr 
ACC 


Gin 
CAG 


Arg 

CGC 


474 


Thr 
ACA 


Glu 
GAG 


Phe 
TTT 


Ser 
TCT 


Cys 
TGT 


Glu 
GAA 


Ala 
GCC 


Arg 
CGC 


Asn 
AAC 


He 
ATA 


Lys 
AAA 


Gly 

GGC 


Leu 

PTP 


513 


Ala 
GCC 


Thr 
ACT 


Ser 
TCC 


Arg 
CGA 


Pro 
CCA 


Ala 
GCC 


He 
ATT 


Val 
GTT 


Arg 
CGC 


Leu 
CTT 


Gin 
CAA 


Ala 

GCA 


Pro 

ppp 


552 


Pro 
CCT 


Ala 
GCA 


Ala 
GCT 


Pro 
CCT 


Phe 
TTC 


Asn 
AAC 


Thr 
ACC 


Thr 
ACA 


Val 
GTA 


Thr 
ACA 


Thr 
ACG 


He 

ATP 

AIL 


Ser 

TPP 
ICC 


591 


Ser 
AGC 


Tyr 
TAC 


Asn 
AAC 


Ala 
GCT 


Ser 
AGC 


Val 
GTG 


Ala 
GCC 


Trp 
TGG 


Val 
GTG 


Pro 
CCA 


Gly 


Ala 

PPT 


Asp 
GAL 


630 


Gly 
GGC 


Leu 
CTA 


Ala 
GCT 


Leu 
CTG 


Leu 
CTG 


His 
CAT 


Ser 
TCC 


Cys 
TGT 


Thr 
ACT 


Val 


Gin 
pap 


Val 


Ala 
GCA 


669 


His 
CAC 


Ala 
GCC 


Pro 
CCA 


Gly 
GGA 


Glu 
GAA 


Trp 
TGG 


Glu 
GAG 


Ala 
GCC 


Leu 

CTT 


Ala 


Val 


Val 

GTG 


Val 
GTT 


708 


Pro 
CCT 


Val 
GTG 


Pro 
CCA 


Pro 
CCT 


Phe 
TTT 


Thr 
ACC 


Cys 
TGC 


Leu 
CTG 


Leu 
CTT 


Arg 


Asn 

AfiP 


Leu 

lib 


Ala 
GCC 


747 


Pro 
CCT 


Ala 
GCC 


Thr 
ACC 


Asn 
AAC 


Tyr 
TAC 


Ser 
AGC 


Leu 
CTT 


Arg 
AGG 


Val 
GTG 


Arg 
pp.p 


Cys 

TPT 


Ala 

GCC 


Asn 
AAT 


786 


Ala 
GCC 


Leu 
TTG 


Gly 
GGC 


Pro 
CCT 


Ser 
TCT 


Pro 
CCC 


Tyr 
TAC 


Gly 
GGC 


Asp 
GAC 


Trp 


Val 

GIG 


Pro 
i~t 

CCC 


Phe 
TTT 


825 


Gin 
CAG 


Thr 
ACA 


Lys 
AAG 


Gly 
GGC 


Leu 
CTA 


Ala 
GCG 


Pro 
CCA 


Ala 
gcc 


Arg 


Ala 


Pro 
CCT 


Gin 
CAG 


Asn 
AAT 


864 


Phe 
TTC 


His 
CAT 


Ala 
GCC 


He 
ATT 


Arg 
CGT 


Thr 
ACC 


Asp 
GAC 


Ser 
TCA 


Gly 

gcc 


Leu 

PTT 


He 

aTP 
AIL, 


Leu 
CTG 


Glu 
GAA 


903 


Trp 
TGG 


Glu 
GAA 


Glu 
GAA 


Val 
GTG 


He 
ATT 


Pro 
CCT 


Glu 
GAG 


Asp 
GAC 


Pro 

CCT 


Gly 

\J\J\J 


Glu 

pa a 
unn 


Gly 

GGC 


Pro 
CCC 


941 


Leu 
CTA 


Gly 
GGA 


Pro 
CCT 


Tyr 
TAT 


Lys 
AAG 


Leu 
CTG 


Ser 
TCC 


Trp 
TGG 


Val 
GTC 


Gin 
CAA 


Glu 

P.AA 


Asn 
a bt 


Gly 

GGA 


981 


Thr 
ACC 


Gin 
CAG 


Asp 
GAT 


Glu 
GAG 


Leu 
CTG 


Met 
ATG 


Val 
GTG 


Glu 
GAA 


Gly 
GGG 


Thr 

APP 


Arg 
app 


Ala 

GCC 


Asn 
AAT 


1020 


Leu 
CTG 


Thr 
ACC 


Asp 
GAC 


Trp 
TGG 


Asp 
GAT 


Pro 
CCC 


Gin 
CAG 


Lys 
AAG 


Asp 

GAC 


Leu 

PTP 
Liu 


He 

a mm 
Al I 


Leu 
TTG 


Arg 
CGT 


1059 


Val 
GTG 


Cys 
TGT 


Ala 
GCC 


Ser 
TCC 


Asn 
AAT 


Ala 
GCA 


He 
ATT 


Gly 
GGT 


Asp 
GAT 


Gly 
GGG 


Pro 
CCC 


Trp 
TGG 


Ser 
AGT 


1098 


Gin 
CAG 


Pro 
CCA 


Leu 
CTG 


Val 
GTG 


Val 
GTG 


Ser 
TCT 


Ser 
TCT 


His 
CAT 


Asp 
GAC 


His 
CAT 


Ala 
GCA 


Gly 
GGG 


Arg 
AGG 


1137 


Gin 
CAG 


Gly 
GGC 


Pro 
CCT 


Pro 
CCC 


His 
CAC 


Ser 
AGC 


Arg 
CGC 














1158 



(13) INFORMATION FOR SEQUENCE ID NO. 12: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 
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(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(2) MOLECULE TYPE: cDNA 

(3) SEQUENCE DESCRIPTION: SEQ ID NO. 12: 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr 

GCA GGT CTG AAG CTC ATG GGA GCC CCG GTG AAG CTG ACA 39 

Val Ser Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val 

GTG TCT CAG GGG CAG CCG GTG AAG CTC AAC TGC AGT GTG 78 

Glu Gly Met Glu Glu Pro Asp He Gin Trp Val Lys Asp 

GAG GGG ATG GAG GAG CCT GAC ATC CAG TGG GTG AAG GAT 117 

Gly Ala Val Val Gin Asn Leu Asp Gin Leu Tyr He Pro 

GGG GCT GTG GTC CAG AAC TTG GAC CAG TTG TAC ATC CCA 156 

Val Ser Glu Gin His Trp He Gly Phe Leu Ser Leu Lys 

GTC AGC GAG CAG CAC TGG ATC GGC TTC CTC AGC CTG AAG 195 

Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gin 

TCA GTG GAG CGC TCT GAC GCC GGC CGG TAC TGG TGC CAG 234 

Val Glu Asp Gly Gly Glu Thr Glu He Ser Gin Pro Val 

GTG GAG GAT GGG GGT GAA ACC GAG ATC TCC CAG CCA GTG 273 

Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu 

TGG CTC ACG GTA GAA GGT GTG CCA TTT TTC ACA GTG GAG 312 

Pro Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin 

CCA AAA GAT CTG GCA GTG CCA CCC AAT GCC CCT TTC CAA 351 

Leu Ser Cys Glu Ala Val Gly Pro Pro Glu Pro Val Thr 

CTG TCT TGT GAG GCT GTG GGT CCC CCT GAA CCT GTT ACC 390 

He Val Trp Trp Arg Gly Thr Thr Lys He Gly Gly Pro 

ATT GTC TGG TGG AGA GGA ACT ACG AAG ATC GGG GGA CCC 429 

Ala Pro Ser Pro Ser Val Leu Asn Val Thr Gly Val Thr 

GCT CCC TCT CCA TCT GTT TTA AAT GTA ACA GGG GTG ACC 468 

Gin Ser Thr Met Phe Ser Cys Glu Ala His Asn Leu Lys 

CAG AGC ACC ATG TTT TCC TGT GAA GCT CAC AAC CTA AAA 507 

Gly Leu Ala Ser Ser Arg Thr Ala Thr Val His Leu Gin 

GGC CTG GCC TCT TCT CGC ACA GCC ACT GTT CAC CTT CAA 546 

Ala Leu Pro Ala Ala Pro Phe Asn He Thr Val Thr Lys 

GCA CTG CCT GCA GCC CCC TTC AAC ATC ACC GTG ACA AAG 585 

Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly 

CTT TCC AGC AGC AAC GCT AGT GTG GCC TGG ATG CCA GGT 624 

Ala Asp Gly Arg Ala Leu Leu Gin Ser Cys Thr Val Gin 

GCT GAT GGC CGA GCT CTG CTA CAG TCC TGT ACA GTT CAG 663 

Val Thr Gin Ala Pro Gly Gly Trp Glu Val Leu Ala Val 

GTG ACA CAG GCC CCA GGA GGC TGG GAA GTC CTG GCT GTT 702 

Val Val Pro Val Pro Pro Phe Thr Cys Leu Leu Arg Asp 

GTG GTC CCT GTG CCC CCC TTT ACC TGC CTG CTC CGG GAC 741 

Leu Val Pro Ala Thr Asn Tyr Ser Leu Arg Val Arg Cys 

CTG GTG CCT GCC ACC AAC TAC AGC CTC AGG GTG CGC TGT 780 
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Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala Asp Trp Val 

GCC AAT GCC TTG GGG CCC TCT CCC TAT GCT GAC TGG GTG 819 

Pro Phe Gin Thr Lys Gly Leu Ala Pro Ala Ser Ala Pro 

CCC TTT CAG ACC AAG GGT CTA GCC CCA GCC AGC GCT CCC 858 

Gin Asn Leu His Ala lie Arg Thr Asp Ser Gly Leu lie 

CAA AAC CTC CAT GCC ATC CGC ACA GAT TCA GGC CTC ATC 897 

Leu Glu Trp Glu Glu Val He Pro Glu Ala Pro Leu Glu 

TTG GAG TGG GAA GAA GTG ATC CCC GAG GCC CCT TTG GAA 936 

Gly Pro Leu Gly Pro Tyr Lys Leu Ser Trp Val Gin Asp 

GGC CCC CTG GGA CCC TAC AAA CTG TCC TGG GTT CAA GAC 975 

Asn Gly Thr Gin Asp Glu Leu Thr Val Glu Gly Thr Arg 

AAT GGA ACC CAG GAT GAG CTG ACA GTG GAG GGG ACC AGG 1014 

Ala Asn Leu Thr Gly Trp Asp Pro Gin Lys Asp Leu He 

GCC AAT TTG ACA GGC TGG GAT CCC CAA AAG GAC CTG ATC 1053 

Val Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro 

GTA CGT GTG TGC GTC TCC AAT GCA GTT GGC TGT GGA CCC 1092 

Trp Ser Gin Pro Leu Val Val Ser Ser His Asp Arg Ala 

TGG AGT CAG CCA CTG GTG GTC TCT TCT CAT GAC CGT GCA 1131 



Gly Gin Gin Gly Pro Pro His Ser Arg 
GGC CAG CAG GGC CCT CCT CAC AGC CGC 



1 158 
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CLAIMS: 

1. A mammalian receptor tyrosine kinas^ which is a developmental tyrosine 
kinase (Dtk) and which is expressed in multipotential haematopoietic cells, in 

5 embryonic stem cells, in brain tissue and in testis, but which is not expressed in 

mature lineage-restricted haematopoietic cells. 

2. A receptor tyrosine kinase according to claim 1 that is murine Dtk having 
the amino acid sequence of SEQ ID NO 1, or a functional equivalent thereof. 

10 

3. A receptor tyrosine kinase according to claim 1 that is mature murine Dtk 
having the amino acid sequence of SEQ ID NO 2. 

4. A receptor tyrosine kinase according to claim 1 that is human Dtk having the 
15 amino acid sequence of SEQ ID NO 3, or a functional equivalent thereof. 

5. A receptor tyrosine kinase according to claim 1 that is mature human Dtk 
having the amino acid sequence of SEQ ID NO 4. 

20 6. An extracellular receptor domain of a receptor tyrosine kinase according to 

claim 1. 

7. An extracellular receptor domain which is the extracellular receptor domain 
of mature murine Dtk as defined in claim 3, or a functional equivalent thereof. 

25 

8. An extracellular receptor domain of a receptor tyrosine kinase having the 
amino acid sequence of SEQ ID NO 5. 

9. An extracellular receptor domain which is the extracellular receptor domain 
30 of mature human Dtk as defined in claim 5, or a functional equivalent thereof. 

10. An extracellular receptor domain of a receptor tyrosine kinase having the 
amino acid sequence of SEQ ID NO 6. 
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11. An extracellular receptor domain according to any one of claims 6 to 10 
which is bound or attached to a support. 

12. A soluble receptor comprising the extracellular receptor domain of a 
receptor tyrosine kinase according to any one of claims 1 to 5 lacking the 
transmembrane region and catalytic domain of said receptor tyrosine kinase. 

13. A nucleic acid molecule encoding a receptor tyrosine kinase as defined in 
claim 1. 

14. A nucleic acid molecule encoding murine Dtk or a functional equivalent 
thereof as defined in claim 2. 

15. A nucleic acid molecule according to claim 14 which is DNA 

16. A DNA molecule according to claim 15 having the nucleotide sequence of 
SEQ ID NO 7. 

17. A nucleic acid molecule encoding mature murine Dtk as defined in claim 3. 

18. A nucleic acid molecule according to claim 17 which is DNA 

19. A DNA molecule according to claim 18 having the nucleotide sequence of 
SEQ ID NO 8. 

20. A nucleic acid molecule encoding human Dtk or a functional equivalent 
thereof as defined in claim 4. 

21. A nucleic acid molecule according to claim 20 which is DNA 

22. A DNA molecule according to claim 21 having the nucleotide sequence of 
SEQ ID NO 9. 



10 



15 
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23. A nucleic acid molecule encoding mature human Dtk as defined in claim 5. 

24. A nucleic acid molecule according to claim 23 which is DNA. 

25. A DNA molecule according to claim 24 having the nucleotide sequence of 
SEQ ID NO 10. 

26. A nucleic acid molecule encoding an extracellular receptor domain as 
defined in claim 6. 

27. A nucleic acid molecule encoding the extracellular receptor domain of 
murine Dtk or a functional equivalent thereof as defined in claim 7. 

28. A nucleic acid molecule according to claim 27 which is DNA. 

29. A DNA molecule according to claim 28 having the nucleotide sequence of 
SEQ ID NO 11. 



30. A nucleic acid molecule encoding the extracellular receptor domain of 
20 human Dtk or a functional equivalent thereof as defined in claim 9. 

31. A nucleic acid molecule according to claim 30 which is DNA 

32. A DNA molecule according to claim 31 having the nucleotide sequence of 
25 SEQ ID NO 12. 

33. A vector including a DNA molecule as defined in claim 13. 

34. A vector including a DNA molecule as defined in any one of claims 15, 16, 
30 18 and 19. 

35. A vector including a DNA molecule as defined in any one of claims 21, 22, 
24 and 25. 
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36. A vector including a DNA molecule as defined in claim 28 or claim 29. 

37. A vector including a DNA molecule as defined in claim 31 or claim 32. 



38. A method of producing a receptor tyrosine kinase comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected with a 
vector as claimed in any one of claims 33-35 to express the encoded 
receptor tyrosine kinase; and 

(b) recovering the expressed receptor tyrosine kinase. 



39. A method of producing an extracellular receptor domain of a receptor 
tyrosine kinase comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected with a 
vector as claimed in claim 36 or claim 37 to express the encoded 
extracellular receptor domain; and 

(b) recovering the expressed extracellular receptor domain. 

40. A recombinant receptor tyrosine kinase which is the product of a method a< 
defined in claim 38. 



41. A recombinant extracellular receptor domain which is the product of a 
method as defined in claim 39. 

42. A ligand that binds to a receptor tyrosine kinase as defined in claim 1. 

43. A ligand that binds to a receptor tyrosine kinase as defined in claim 2. 

44. A ligand that binds to a receptor tyrosine kinase as defined in claim 3. 

45. A ligand that binds to a receptor tyrosine kinase as defined in claim 4. 

46. A ligand that binds to a receptor tyrosine kinase as defined in claim 5. 
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47. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as defined in claim 6. 

48. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as defined in claim 7. 

49. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as defined in claim 8. 

50. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as defined in claim 9. 

51. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as claimed in claim 10. 

52. A ligand that binds to an extracellular receptor domain of a receptor tyrosine 
kinase as claimed in claim 11. 

53. A ligand that binds to a soluble receptor as defined in claim 12. 

54. A ligand that binds to a receptor tyrosine kinase as claimed in claim 40. 

55. A ligand that binds to an extracellular receptor domain as claimed in claim 
41. 

56. A ligand according to any one of claims 42-55 wherein the ligand stimulates 
the proliferation, differentiation and/or survival of cells which express a receptor 
tyrosine kinase according to claim 1. 

57. A ligand according to any one of claims 42-55 wherein the ligand is 
antagonistic and at least partially blocks or inhibits the function of a receptor 
tyrosine kinase according to claim 1 through binding to said receptor. 
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58. A method of stimulating the proliferation, differentiation and/or survival of a 
cell expressing a receptor tyrosine kinase according to claim 1 comprising 
contacting the cell with a ligand according to claim 56. 

59. A method according to claim 58 wherein the stimulation occurs in vivo. 

60. A method according to claim 58 wherein the stimulation occurs ex vivo. 

61. A method of inhibiting the function of a receptor tyrosine kinase according 
to claim 1 comprising contacting the receptor with a ligand according to claim 57. 

62. A method according to claim 61 wherein the inhibition occurs in vivo. 

63. A method according to claim 61 wherein the inhibition occurs ex vivo. 

64. A method of treating a disease, syndrome or condition caused or mediated 
by an excess of a ligand as claimed in claim 56 comprising the step of contacting 
said excess of said ligand with an effective amount of a receptor tyrosine kinase 
according to any one of claims 1-5 and 40, an extracellular receptor domain 
according to any one of claims 6-11 and 41 or a soluble receptor according to 
claim 12. 

65. A method of treating a disease, syndrome or condition caused or mediated 
by an excess of a ligand as defined in claim 57 comprising the step of contacting 
said excess of said ligand with an effective amount of a receptor tyrosine kinase 
according to any one of claims 1-5 and 40, an extracellular receptor domain 
according to any one of claims 6-11 and 41 or a soluble receptor according to 
claim 12. 

66. A method of extracting a ligand as defined in claim 56 or claim 57 from a 
medium which may contain said ligand comprising the step of contacting said 
medium with a receptor tyrosine kinase according to any one of claims 1-5 and 40, 
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an extracellular receptor domain according to any one of claims 6-11 and 41 or a 
soluble receptor according to claim 12. 

67. A method of isolating ligand(s) as defined in claim 56 or claim 57 from a 
5 medium which may contain said ligand(s), comprising the steps of: 

(a) contacting said medium with an effective amount of a receptor tyrosine 
kinase according to any one of claims 1-5 and 40, an extracellular 
domain according to any one of claims 6-11 and 41 or a soluble 
receptor according to claim 12; 
10 (b) detecting which ligand(s) bind to said tyrosine kinase receptor, 

extracellular receptor domain or soluble receptor; and 

(c) isolating such bound ligand(s). 

68. A ligand which is isolated by a method according to claim 67. 
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